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TITLE 

SUPPRESSION OF SPECIFIC CLASSES 
OF SOYBEAN SEED PROTEIN GENES 
FIELD OF THE INVENTION 
5 This invention concerns the construction of transgenic soybean lines 

wherein the expression of genes encoding seed storage proteins is modified to 
effect a change in seed storage protein profile of transgenic plants. Such modified 
transgenic soybean lines are used for the production of novel soy protein products 
with unique and valuable functional characteristics. 
10 BACKGROUND OF THE INVENTION 

Soybean seeds contain from 35% to 55% protein on a dry weight basis. 
The majority of this protein is storage protein, which is hydrolyzed during 
germination to provide energy and metabolic intermediates needed by the 
developing seedling. The soybean seed's storage protein is an important 
15 nutritional source when harvested and utilized as a livestock feed. In addition, it 
is now generally recognized that soybeans are the most economical source of 
protein for human consumption. Soy protein or protein isolates are already used 
extensively for food products in different parts of the world. Much effort has been 
devoted to improving the quantity and quality of the storage protein in soybean 
20 seeds. 

The seeds of most plant species contain what are known in the art as seed 
storage proteins. These have been classified on the basis of their size and 
solubility (Higgins, T. J. (1984) Ann. Rev. Plant Physiol. 55:191-221). While not 
every class is found in every species* the seeds of most plant species contain 

25 proteins from more than one class. Proteins within a particular solubility or size 
class are generally more structurally related to members of the same class in other 
species than to members of a different class within the same species. In many 
species, the seed proteins of a given class are often encoded by multigene families, 
sometimes of such complexity that the families can be divided into subclasses 

30 based on sequence homology. 

There are two major soybean seed storage proteinsiglycinin (also known as 
the 1 IS globulins) and p-conglycinin (also known as the 7S globulins). Together, 
they comprise 70 to 80% of the seed's total protein, or 25 to 35% of the seed's dry 
weight. Glycinin is a large protein with a molecular weight of about 360 kDa. It 

35 is a hexamer composed of the various combinations of five major isoforms 

(commonly called subunits) identified as GK G2, G3, G4 and G5. Each subunit is 
in turn composed of one acidic and one basic polypeptide held together by a 
disulfide bond. Both the acidic and basic polypeptides of a single subunit are 
coded for by a single gene. Hence, there are five non-alleiic genes that code for 
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the five glycinin subunits. These genes are designated Gyl, Gy2, Gy3, Gy4 and 
Gy5, corresponding to subunits Gl, G2, G3, G4 and G5, respectively (Nielsen, N. 
C. etal. (1989) Plant Cell 7:313-328). 

Genomic clones and cDNA's for glycinin subunit genes have been 
5 sequenced and fall into two groups based on nucleotide and amino acid sequence 
similarity. Group I consists of Gyl, Gy2, and Gy3, whereas Group II consists of 
Gy4 and Gy5. There is greater than 85% similarity between genes within a group 
(i.e., at least 85% of the nucleotides of Gyl, Gy2 and Gy3 are identical, and at 
least 85% of the nucleotides of Gy4 and Gy5 are identical), but only 42% to 46% 

10 similarity between the genes of Group I and Group II. 

(5-Conglycinin (a 7S globulin) is a heterogeneous glycoprotein with a 
molecular weight ranging from 150 and 240 kDa. It is composed of varying 
combinations of three highly negatively charged subunits identified as a, a 4 and p< 
cDNA clones representing the coding regions of the genes encoding the the a and 

15 a 1 subunits have been sequenced and are of similar size; sequence identity is 

limited to 85%. The sequence of the cDNA representing the coding region of the 
(5 subunit, however, is nearly 0.5 kb smaller than the cc and a' cDNAs. Excluding 
this deletion, sequence identity to the a and ct' subunits is 75-80%. The three 
classes of |3-conglycinin subunits are encoded by a total of 1 5 subunit genes 

20 clustered in several regions within the genome soybean (Harada, J. J. et al. (1989) 
Plant Cell 7:415-425). 

New soy based products such as protein concentrates, isolates, and 
textured protein products are increasingly utilized in countries that do not 
necessarily accept traditional oriental soy based foods. Use of these new products 

25 in food applications, however, depends on local tastes and functional 

characteristic of the protein products relative to recipe requirements. Over the 
past 10 years, significant effort has been aimed at understanding the functional 
characteristics of soybean proteins. Examples of functional characteristics include 
water sorption parameters, wettability, swelling, water holding, solubility, 

30 thickening, viscosity, coagulation, gelation characteristics and emulsification 
properties, A large portion of this body of research has focused on study of the 
P-conglycinin and glycinin proteins individually, as well as how each of these 
proteins influences the soy protein system as a whole (Kinsella, J. E. et aL (1985) 
New Protein Foods 5:107-179; Morr, C. V. (1987) JAOCS 67:265-271; Peng, L, 

35 C. et al (1984) Cereal Chem 67:480-489). Because functional properties are 

directly related to physiochemical properties of proteins, the structural differences 
of {i-conglycinin and glycinin result in these two proteins having significantly 
different functional characteristics. Differences in thermal aggregation, 
emulsifying properties, and water holding capacity have been reported. In 

2 
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addition, gelling properties vary as welh with glycinin forming gels that have 
greater tensile strain, stress, and shear strength, better solvent holding capacity, 
and lower turbidity. However, soy protein products produced today are a blend of 
both glycinin and P-conglycinin and therefore have functional characteristics 
5 dependent on the blend of glycinin's and P-conglycinin's individual 

characteristics. For example, when glycinin is heated to 100°C, about 50% of the 
protein is rapidly converted into soluble aggregates. Further heating results in the 
enlargement of the aggregates and in their precipitation. The precipitate consists 
of the glycinin's basic polypeptides; the acidic polypeptides remain soluble. The 

1 0 presence of p-conglycinin inhibits the precipitation of the basic polypeptides by 
forming soluble complexes with them. Whether heat denaturation is desireable or 
not depends on the intended use. If one could produce soy protein products 
containing just one or the other storage protein, products requiring specific 
physical characteristics derived from particular soy proteins would become 

1 5 available or would be more economical to produce. 

Over the past 20 years, soybean lines lacking one or more of the various 
storage protein subunits (null mutations) have been identified in the soybean 
germplasm or produced using mutational breeding techniques. Breeding efforts to 
combine mutational events have resulted in soybean lines whose seeds contain 

20 about half the normal amount of p-conglycinin (Takashashi, K. et al. (1994) 

Breeding Science 44:65-66\ Kitamura, J. ( 1 995) JARQ 29: 1-8). The reduction of 
p-conglycinin is controlled by three independent recessive mutations. 
Recombining glycinin subunit null mutations have resulted in lines whose seeds 
have significantly reduced amounts of glycinin (Kitamura, J, (1995) JARQ 

25 29: 1 -8), Again, reduction is controlled by three independent recessive mutations. 
Developing agronomically viable soybean varieties from the above lines, in which 
the seed contains only glycinin or p-conglycinin, will be time consuming and 
costly. Each cross will result in the independent segregation of the three 
mutational events. In addition, each mutational event will need to be in the 

30 homozygous state. Development of high yielding agronomically superior soybean 
lines will require the screening and analysis of a large number of progeny over 
numerous generations. 

Antisense technology has been used to reduce specific storage proteins in 
seeds. In Brassica napus, napin (a 2S albumin) and cruciferin (an 1 IS globulin) 

35 are the two major storage proteins, comprising about 25% and 60% of the total 
seeds protein, respectively. Napin proteins are coded for by a large multi-gene 
family of up to 16 genes; several cDNA and genomic clones have been sequenced 
(Josefsson, L.~G. et al. (1987) J Biol Chern 262:12196-12201; Schofield, S. and 
Crouch, M. L. (1987) J BioL Chem. 252:12202-12208). The genes exhibit 

3 
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greater than 90% sequence identity in both their coding and flanking regions. The 
cruciferin gene family is equally complex, comprising 3 subfamilies with a total 
of 8 genes (Rodin, J. et aL (1992) Plant Mol Biol 20:559-563). Kohno-Murase et 
aL ((1994) Plant Mol Biol 26: 1 115-1 124) demonstrated that a napin antisense 
5 gene using the nap A gene driven by the nap A promoter could be used to construct 
transgenic plants whose seeds contained little or no napin. 

The same group (Kohno-Murase et aL (1995) Theoret Applied Genetics 
97:627-63 1) attempted to reduce cruciferin (1 IS globulin) expression in Brassica 
napus by expressing an antisense form of a cruciferin gene (cruA, encoding an 
1 0 alpha 2/3 isoform) under the control of the napA promoter. In this case the results 
were more complex. The cruciferins are divided into three subclasses based on 
sequence identity (alpha I, 2/3, and 4); the classes each have from 60-75% 
sequence identity with each other (Rodin, J. et aL (1992) Plant Mol. Biol. 
20:559-563). Expression of the antisense gene encoding the alpha 2/3 isoform 
15 resulted in lower levels of the alpha 1 and 2/3 forms. However, there was no 
reduction in the expression of the alpha 4 class. 

Antisense technology was used to reduce the level of the seed storage 
protein, glutelin, in rice. Expression of the seed specific glutelin promoter 
operably linked to the full length antisense glutelin coding region resulted in about 
20 a 25% reduction in glutelin protein levels (U.S. Patent No. 5,516,668). 

SUMMARY OF THE INVENTION 
The instant invention provides a method for reducing the quantity glycinin 
or (5-conglycinin (1 IS or 7S globulins, respectively) seed storage proteins in 
soybeans. In one embodiment, cosuppression technology was used to suppress 
25 the expression of genes encoding the 7S-gIobulin class of seed protein genes. 
Genes encoding either two (a and a') or all three subclasses (a, a' and p) of 7S 
globulins were suppressed by expression of the gene encoding a single subclass 
(a) of p-conglycinin, resulting in soybean lines with altered seed storage profiles. 
In another embodiment, a method for supressing two completely different genes, 
30 only one of which is a seed protein gene, is presented, allowing for multiple 
changes in seed composition. Surprisingly, expression of a chimeric gene 
comprising the promoter region of a soybean seed storage protein operably linked 
to the coding region of a soybean gene whose expression alters the fatty acid 
profile of transgenic soybean seeds resulted in simultaneous alteration of two 
35 distinct phenotypic traits: seed storage protein profile and seed oil profile. 

The method for reducing the quantity of soybean seed storage protein 
taught herein comprises the following steps: 

(a) constructing a chimeric gene comprising (i) a nucleic acid 
fragment encoding a promoter that is functional in the cells of 
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soybean seeds, (ii) a nucleic acid fragment encoding all or a 
portion of a soybean seed storage protein placed in sense or 
antisense orientation relative to the promoter of (i), and (iii) a 
transcriptional termination region; 
5 (b) creating a transgenic soybean cell by introducing into a soybean 

cell the chimeric gene of (a); and 
(c) growing the transgenic soybean cells of step (b) under conditions 
that result in expression of the chimeric gene of step (a) 
wherein the quantity of one or more members of a class of soybean seed storage 
10 protein subunits is reduced when compared to soybeans not containing the 
chimeric gene of step (a). 

DETAILED DESCRIPTION OF THE INVENTION 
BRIEF DESCRIPTION OF THE SEQUENCE DESCRIPTIONS 
The invention can be more fully understood from the following detailed 
15 description and the Sequence Descriptions which form a part of this application. 
The Sequence Descriptions contain the three letter codes for amino acids as 
defined in 37 C.F.R. 1 .822 which are incorporated herein by reference. 

SEQ ID NO: 1 shows the 5 r to 3' nucleotide sequence encoding the a 
subunit of the p-conglycinin soybean seed storage protein. 
20 SEQ ID NO:2 shows the 5* to 3* nucleotide sequence encoding the a 1 

subunit of the P-conglycinin soybean seed storage protein, 

SEQ ID NO: 3 shows the 5' to 3* nucleotide sequence encoding the p 
subunit of the p-conglycinin soybean seed storage protein. 

SEQ ID NOS:4 and 5 show the nucleotide sequences of the PCR primers 
25 ConS and Con 1 -4a (respectively) used to isolate nucleic acid fragments encoding 
the a and a 1 subunits of the p-conglycinin soybean seed storage protein. 

SEQ ID NOS:6 and 7 show nucleotide sequences of the PCR primers 
Con.09 and Con.8 (respectively) used to distinguish nucleic acid fragments 
encoding the a and a' subunits of the P-conglycinin soybean seed storage protein. 
30 SEQ ID NOS:8 and 9 show the nucleotide sequences of the PCR primers 

ConSa and Conl .9a (respectively) used to isolate full length cDNAs encoding the 
a and a' subunits of the p-conglycinin soybean seed storage protein. 

SEQ ID NO: 10 shows the nucleotide sequence of the PCR primer Con. 1.0 
used to confirm the full length cDNA encoding the a and a T subunits of the 
35 p-conglycinin soybean seed storage protein. 

SEQ ID NOS:l 1, 12 and 13 show the 5' to 3' nucleotide sequences 
encoding the Gy 1 , Gy2 and Gy3 subunits (respectively) of the group I glycinin 
soybean seed storage protein. 
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SEQ ID NOS:14 and 15 show the 5' to 3' nucleotide sequences encoding 
the Gy4 and Gy5 subunits (respectively) of the group II glycinin soybean seed 
storage protein. 

SEQ ID NOS: 16, 17 and 18 show the nucleotide sequences of the PCR 
5 primers G 1 - 1 , G 1 - 1 039 and G 1 - 1 475 (respectively) used to isolate the cDNAs 
encoding the subunits of the group I glycinin soybean seed storage protein, 

SEQ ID NOS: 19, 20 and 21 show the nucleotide sequences of the PCR 
primers G4-7, G4-1251, and G4-1670 (respectively) used to isolate the cDNA 
encoding the subunits of the group II glycinin soybean seed storage protein. 
10 BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a restriction map of plasmid pML70, used as an intermediate 
cloning vehicle in construction of chimeric genes of the instant invention. 

Figure 2 is a restriction map of plasmid pCW109, used as an intermediate 
cloning vehicle in construction of chimeric genes of the instant invention. 
1 5 Figure 3 is a restriction map of plasmid pKS 1 8HH, used as an intermediate 

cloning vehicle in construction of chimeric genes of the instant invention. 

Figure 4 is a restriction map of plasmid pJol . This plasmid was derived 
by cloning the plant transcriptional unit KTi promoter/truncated a subunit of 
{3-eonglycinin/KTi 3' end into the BamH I site of pKS18HH. 
20 Figure 5 is an SDS-PAGE gel of extracted protein from somatic embryos 

transformed with pJol, 

Figure 6 is a restriction map of plasmid pBS43. This plasmid comprises a 
nucleic acid sequence encoding the Glycine max microsomal delta-12 desaturase 
under the transcriptional control of the soybean p-conglycinin promoter. 
25 Figure 7 is an SDS-PAGE gel of extracted protein from soybean seeds 

obtained from plants transformed with pBS43. 

Figure 8 is a restriction map of plasmid pJo3. This plasmid was derived 
by cloning the plant transcriptional unit KTi promoter/full length cDNA of the a 
subunit of P-conglycinin/KTi 3* end into the Hindlll site of pKS18HR 
30 Figure 9 is a restriction map of plasmid pRB20. This plasmid was derived 

by cloning the transcriptional unit p-conglycinin promoter/Phaseolin 3' end into 
the Hindlll site of pKS 1 8HFL It is used as an intermediate cloning vehicle in 
construction of chimeric genes of the instant invention, 

BIOLOGICAL DEPOSITS 
35 The following plasmids have been deposited under the terms of the 

Budapest Treaty at American Type Culture Collection (ATCC), 12301 Parklawn 
Drive, Rockville, MD 20852, and bear the following accession numbers: 



6 
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Plasmid Accession Number Date of Deposit 

pJol ATCC 97614 June 15. 1996 

pBS43 ATCC 976 1 9 June 1 9, 1 996 

P Jo3 ATCC 9761 5 June 15, 1996 



DEFINITIONS 

In the context of this disclosure, a number of terms shall be used. The 
term "nucleic acid" refers to a large molecule which can be single- stranded or 
5 double-stranded, composed of monomers (nucleotides) containing a sugar, a 
phosphate and either a purine or pyrimidine. A "nucleic acid fragment" is a 
fraction of a given nucleic acid molecule. In higher plants, deoxyribonucleic acid 
(DNA) is the genetic material while ribonucleic acid (RNA) is involved in the 
transfer of the information in DNA into proteins. A "genome" is the entire body 
1 0 of genetic material contained in each cell of an organism. The term "nucleotide 
sequence" refers to the sequence of DNA or RNA polymers, which can be single- 
or double-stranded, optionally containing synthetic, non-natural or altered 
nucleotide bases capable of incorporation into DNA or RNA polymers. 

As used herein, the term "homologous to" refers to the relatedness 
1 5 between the nucleotide sequence of two nucleic acid molecules or between the 
amino acid sequences of two protein molecules. Estimates of such homology are 
provided by either DNA-DNA or DNA-RNA hybridization under conditions of 
stringency as is well understood by those skilled in the art (Hames and Higgins, 
Eds. (1985) Nucleic Acid Hybridisation, IRL Press, Oxford, U.K.); or by the 
20 comparison of sequence similarity between two nucleic acids or proteins, such as 
by the method of Needleman et al. ((1970) J. Mol Biol 4£:443-453). 

As used herein, "essentially similar" refers to DNA sequences that may 
involve base changes that do not cause a change in the encoded amino acid, or 
which involve base changes which may alter one or more amino acids, but do not 
25 affect the functional properties of the protein encoded by the DNA sequence. It is 
therefore understood that the invention encompasses more than the specific 
exemplary sequences. Modifications to the sequence, such as deletions, 
insertions, or substitutions in the sequence which produce silent changes that do 
not substantially affect the functional properties of the resulting protein molecule 
30 are also contemplated. For example, alteration in the gene sequence which reflect 
the degeneracy of the genetic code, or which results in the production of a 
chemically equivalent amino acid at a given site, are contemplated; thus, a codon 
for the amino acid alanine, a hydrophobic amino acid, may be substituted by a 
codon encoding another hydrophobic amino acid residue such as glycine, valine, 
35 leucine, or isoleucine. Similarly, changes which result in substitution of one 

7 
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negatively charged residue for another, such as aspartic acid for glutamic acid, or 
one positively charged residue for another, such as lysine for arginine, can also be 
expected to produce a biologically equivalent product. Nucleotide changes which 
result in alteration of the N-terminal and C-terminal portions of the protein 
5 molecule would also not be expected to alter the activity of the protein. In some 
cases, it may in fact be desirable to make mutants of the sequence in order to study 
the effect of alteration on the biological activity of the protein. Each of the 
proposed modifications is well within the routine skill in the art, as is 
determination of retention of biological activity of the encoded products. 
10 Moreover, the skilled artisan recognizes that "essentially similar" sequences 

encompassed by this invention can also defined by their ability to hybridize, under 
stringent conditions (0.1X SSC, 0.1% SDS, 65°C), with the sequences 
exemplified herein. 

"Gene" refers to a nucleic acid fragment that expresses a specific protein, 
1 5 including regulatory sequences preceding (5* non-coding) and following (3 1 non- 
coding) the coding region, "Native" gene refers to an isolated gene with its own 
regulatory sequences as found in nature. "Chimeric gene" refers to a gene that 
comprises heterogeneous regulatory and coding sequences not found in nature. 
"Endogenous" gene refers to the native gene normally found in its natural 
20 location in the genome and is not isolated, A "foreign" gene refers to a gene not 
normally found in the host organism but that is introduced by gene transfer, 

"Coding sequence" or "coding region" refers to a DNA sequence that 
codes for a specific protein and excludes the non-coding sequences. It may 
constitute an "uninterrupted coding sequence", i.e., lacking an intron or it may 
25 include one or more introns bounded by appropriate splice junctions. An "intron" 
is a nucleotide sequence that is transcribed in the primary transcript but that is 
removed through cleavage and re-ligation of the RNA within the cell to create the 
mature mRNA that can be translated into a protein. 

"Initiation codon" and "termination codon" refer to a unit of three 
30 adjacent nucleotides in a coding sequence that specifies initiation and chain 
termination, respectively, of protein synthesis (mRNA translation), "Open 
reading frame" refers to the coding sequence uninterrupted by introns between 
initiation and termination codons that encodes an amino acid sequence. 

"RNA transcript" refers to the product resulting from RNA polymerase- 
35 catalyzed transcription of a DNA sequence. When the RNA transcript is a perfect 
complementary copy of the DNA sequence, it is referred to as the primary 
transcript or it may be a RNA sequence derived from posttranscriptional 
processing of the primary transcript and is referred to as the mature RNA, 
"Messenger RNA (mRNA)" refers to the RNA that is without introns and that 
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can be translated into protein by the cell. "cDNA" refers to a double-stranded 
DNA that is complementary to and derived from mRNA. "Sense" RNA refers to 
RNA transcript that includes the mRNA, "Antisense RNA" refers to a RNA 
transcript that is complementary to all or part of a target primary transcript or 
5 mRNA and that blocks the expression of a target gene. The complementarity of 
an antisense RNA may be with any part of the specific gene transcript, i.e., at the 
5* non-coding sequence, 3' non-coding sequence, introns, or the coding sequence. 

As used herein, "suitable regulatory sequences" refer to nucleotide 
sequences in native or chimeric genes that are located upstream (5'), within, or 

10 downstream (3 1 ) to the nucleic acid fragments of the invention* which control the 
expression of the nucleic acid fragments of the invention. The term "expression", 
as used herein, refers to the transcription and stable accumulation of the sense 
(mRNA) or the antisense RNA derived from the nucleic acid fragment(s) of the 
invention that, in conjunction with the protein apparatus of the cell, results in 

15 altered phenotypic traits. Expression of the gene involves transcription of the 

gene and translation of the mRNA into precursor or mature proteins, "Antisense 
inhibition" refers to the production of antisense RNA transcripts capable of 
preventing the expression of the target protein. "Overexpression" refers to the 
production of a gene product in transgenic organisms that exceeds levels of 

20 production in normal or non-transformed organisms. "Cosuppression" refers to 
the expression of a foreign gene which has substantial homology to an 
endogenous gene resulting in the suppression of expression of both the foreign 
and the endogenous gene. "Altered levels" refers to the production of gene 
product(s) in transgenic organisms in amounts or proportions that differ from that 

25 of normal or non-transformed organisms. The skilled artisan will recognize that 
the phenotypic effects contemplated by this invention can be achieved by 
alteration of the level of gene product(s) produced in transgenic organisms relative 
to normal or non-transformed organisms, namely a reduction in gene expression 
mediated by antisense suppression or cosuppression. 

30 "Promoter" refers to a DNA sequence in a gene, usually upstream (5') to 

its coding sequence, which controls the expression of the coding sequence by 
providing the recognition for RNA polymerase and other factors required for 
proper transcription. In artificial DNA constructs, promoters can also be used to 
transcribe antisense RNA. Promoters may also contain DNA sequences that are 

35 involved in the binding of protein factors which control the effectiveness of 

transcription initiation in response to physiological or developmental conditions. 
It may also contain enhancer elements. An "enhancer" is a DNA sequence which 
can stimulate promoter activity. It may be an innate element of the promoter or a 
heterologous element inserted to enhance the level or tissue-specificity of a 
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promoter, k * Constitutive promoters*' refers to those that direct gene expression in 
all tissues and at all times. "Tissue-specific^ or "development-specific" 
promoters as referred to herein are those that direct gene expression almost 
exclusively in specific tissues, such as leaves or seeds, or at specific development 
5 stages in a tissue, such as in early or late embryogenesis, respectively. 

The "3* non-coding sequences" refers to the DNA sequence portion of a 
gene that contains a polyadenylation signal and any other regulatory signal 
capable of affecting mRNA processing or gene expression. The polyadenylation 
signal is usually characterized by affecting the addition of polyadenylic acid tracts 

10 to the 3* end of the mRNA precursor. 

The term "operably linked" refers to nucleic acid sequences on a single 
nucleic acid molecule which are associated so that the function of one is affected 
by the other. For example, a promoter is operably linked with a structural gene 
when it is capable of affecting the expression of that structural gene (i.e., that the 

15 structural gene is under the transcriptional control of the promoter). 

"Transformation" refers to the transfer of a nucleic acid fragment into the 
genome of a host organism, resulting in genetically stable inheritence. Host 
organisms containing the transformed nucleic acid fragments are referred to as 
"transgenic" organisms. 

20 This invention concerns the construction of transgenic soybean lines 

wherein the expression of genes encoding seed storage proteins are modulated to 
effect a change in seed storage protein profile of transgenic plants. Modification 
of the seed storage protein profile can result in production of novel soy protein 
products with unique and valuable functional characteristics, 

25 Gene expression in plants uses regulatory sequences that are functional in 

such plants. The expression of foreign genes in plants is well-established 
(De Blaere et al. (1987) Meth. Ertzymol 753:277-291). The source of the 
promoter chosen to drive the expression of the fragments of the invention is not 
critical provided it has sufficient transcriptional activity to accomplish the 

30 invention by decreasing the expression of the target seed storage protein genes* 
Preferred promoters include strong constitutive plant promoters, such as those 
directing the 19S and 35S transcripts in cauliflower mosaic virus (Odell, J. T. et 
al. (1985) Nature J7J:810~812; Hull et aL (1987) Virology 5(5:482-493). 
Particularly preferred promoters are those that allow seed-specific expression. 

35 Examples of seed-specific promoters include, but are not limited to, the promoters 
of seed storage proteins, which can represent up to 90% of total seed protein in 
many plants. The seed storage proteins are strictly regulated, being expressed 
almost exclusively in seeds in a highly tissue-specific and stage-specific manner 
(Higgins et aL (1984) Arm. Rev. Plant Physiol 55:191-221; Goldberg et al. (1989) 

10 
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Cell 56: 149-160). Moreover, different seed storage proteins may be expressed at 

different stages of seed development. 

Expression of seed-specific genes has been studied in great detail (See 
reviews by Goldberg et al. (1 989) Cell 56: 149-160 and Higgins et aL (1984) Ann, 
5 Rev. Plant Physiol. 55:191 -22 1 ), There are currently numerous examples of seed- 
specific expression of seed storage protein genes (natural or chimeric) in 
transgenic dicotyledonous plants; in general, temporal and spatial expression 
patterns are maintained. The promoters used in such examples could potentially 
be used to affect the present invention. These include genes from dicotyledonous 

1 0 plants for bean p-phaseolin (Sengupta-Gopalan et aL(l985) Proc. Natl Acad. ScL 
USA 52:3320-3324; Hoffman et al. (1988) Plant Mol Biol 77:717-729), bean 
lectin (Voelker et al (1987) EMBO J, (5:3571-3577), soybean lectin (Okamuro et 
al. (1986) Proc, Natl Acad. Set USA 55:8240-8244), soybean Kunitz trypsin 
inhibitor (Perez-Grau et al (1989) Plant Cell 7:095-1109), soybean p-conglycinin 

15 (Beachy et al. (1985) EMBO X 4:3047-3053; pea vicilin (Higgins et aL (1 988) 
Plant Mol Biol 77:683-695), pea convicilin (Newbigin et al. (1990) Planta 
750:461-470), pea legumin (Shirsat et al. (1989) Mol Gen. Genetics 
275:326-331), rapeseed napin (Radke et aL (1988) Theor Appl GeneL 
75:685-694) and Arabidopsis thaliana 2S albumin (Vandekerckhove et al (1989) 

20 Bio/Technology 7:929-932). 

Of particular use in the expression of the nucleic acid fragment of the 
invention will be the heterologous promoters from several soybean seed storage 
protein genes such as those for the Kunitz trypsin inhibitor (KTi; Jofuku et aL 
(1989) Plant Cell 7:1079-1093; glycinin (Nielson et al. (1989) Plant Cell 

25 7 :3 1 3^328), and p-conglycinin (Harada et al. (1989) Plant Cell 1 :4 1 5-425). The 
skilled artisan will recognize that attention must be paid to differences in temporal 
regulation endowed by different seed promoters. For example, the promoter for 
the a-subunit gene is expressed a few days before that for the p-subunit gene 
(Beachy et al. (1985) EMBO J. 4:3047-3053), so that the use of the P-subunit gene 

30 is likely to be less useful for suppressing a-subunit expression. 

Also of potential use, but less preferred, will be the promoters of genes 
involved in other aspects of seed metabolism, such as lipid or carbohydrate 
biosynthesis. In summary, the skilled artisan will have no difficulty in 
recognizing that any promoter of sufficient strength and appropriate temporal 

35 expression pattern can potentially be used to implement the present invention. 
Similarly, the introduction of enhancers or enhancer-like elements into the 
promoter regions of either the native or chimeric nucleic acid fragments of the 
invention would result in increased expression to accomplish the invention. This 
would include vira! enhancers such as that found in the 35S promoter (Odell et al. 
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(1988) Plant MoL Biol / 0:263-272). enhancers from the opine genes (Fromm et 
al. (1989) Plant Cell 7:977-984), or enhancers from any other source that result in 
increased transcription when placed into a promoter operably linked to the nucleic 
acid fragment of the invention. 
5 Of particular importance is the DNA sequence element isolated from the 

gene encoding the a-subunit of f5-conglycinin that can confer a 40-fold, seed- 
specific enhancement to a constitutive promoter (Chen et al. (1989) Dev. Genet 
70:1 12-122). One skilled in the art can readily isolate this element and insert it 
within the promoter region of any gene in order to obtain seed-specific enhanced 

10 expression with the promoter in transgenic plants. Insertion of such an element in 
any seed-specific gene that is normally expressed at times different than the 
p-conglycinin gene will result in expression of that gene in transgenic plants for a 
longer period during seed development. 

Any y non-coding region capable of providing a polyadenylation signal 

1 5 and other regulatory sequences that may be required for the proper expression of 
the nucleic acid fragments of the invention can be used to accomplish the 
invention. This would include 3 1 ends of the native fatty acid desaturase(s), viral 
genes such as from the 35S or the 19S cauliflower mosaic virus transcripts, from 
the opine synthesis genes, ribulose 1 ,5-bisphosphate carboxylase, or chlorophyll 

20 a/b binding protein. There are numerous examples in the art that teach the 
usefulness of different 3' non-coding regions- 
Various methods of transforming cells of higher plants according to the 
present invention are available to those skilled in the art (see European Patent 
Publications EP-A-295,959 and EP-A-3 1 8,341). Such methods include those 

25 based on transformation vectors utilizing the Ti and Ri ptasmids of 

Agrobacterium spp. It is particularly preferred to use the binary type of these 
vectors. Ti -derived vectors transform a wide variety of higher plants, including 
monocotyledonous and dicotyledonous plants (Sukhapinda et al. (1987) Plant 
MoL Biol. £:209-216; Potrykus, (1985) MoL Gen, GeneL 799:183). Other 

30 transformation methods are available to those skilled in the art, such as direct 
uptake of foreign DNA constructs (see European Patent Publication 
EP-A-295,959), techniques of electroporation (Fromm et al. (1986) Nature 
(London) 3 J 9:791) or high-velocity ballistic bombardment with metal particles 
coated with the nucleic acid constructs (Klein et aL (1987) Nature (London) 

35 327:70). Once transformed, the cells can be regenerated by those skilled in the 
art. Of particular relevance are the recently described methods to transform 
soybean, including McCabe et al. ((1988) Bio/Technology tf:923-926), Finer et aL 
((1991) In Vitro CelL Dev. BioL 27:175-182) and Hinchee, M.A.W, ((1988) 
Bio/Technology 6:9 1 5-922). 

12 
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Once transgenic plants are obtained by one of the methods described 
above, it is necessary to screen individual transgenics for those that most 
effectively display the desired phenotype. It is well known to those skilled in the 
art that individual transgenic plants carrying the same construct may differ in 
5 expression levels; this phenomenon is commonly referred to as "position effect". 
Thus, in the present invention different individual transformants may vary in the 
effectiveness of suppression of the target seed protein. The person skilled in the 
art will know that special considerations are associated with the use of antisense or 
cosuppresion technologies in order to reduce expression of particular genes. U.S. 
10 Patent Nos. 5,190,931, 5,107,065 and 5,283,323 have taught the feasibility of 
these techniques, but it is well known that their efficiency is unpredictable. 
Accordingly, the person skilled in the art will make multiple genetic constructs 
containing one or more different parts of the gene to be suppressed, since the art 
does not teach a method to predict which will be most effective for a particular 
15 gene. Furthermore, even the most effective constructs will give an effective 
suppression phenotype only in a fraction of the individual transgenic lines 
isolated. For example, World Patent Publications W093/11245 and W094/1 1516 
teach that when attempting to suppress the expression of fatty acid desaturase 
genes in canola, actual suppression was obtained in less than 1% of the lines 
20 tested. In other species the percentage is somewhat higher, but in no case does the 
percentage reach 100. This should not be seen as a limitation on the present 
invention, but instead as practical matter that is appreciated and anticipated by the 
person skilled in this art. Accordingly, the skilled artisan will develop methods 
for screening large numbers of transformants. The nature of these screens will 
25 generally be chosen on practical grounds, and is not an inherent part of the 
invention. A preferred method will be one which allows large numbers of 
samples to be processed rapidly, since it will be expected that the majority of 
samples will be negative. 

The mechanism of cosuppression remains unclear (for one review and 
30 speculation, see Flavell, R. (1994) Proc. Nail Acad. Sci. USA 97:3490-3496), and 
therefore the exact requirments to induce it when desired are also unclear. Most 
examples found in the literature involve the use of all or a large part of the 
transcribed region of the gene to be cosuppressed to elicit the desired response. 
However, in at least one case (Brusslan et al. (1993) Plant Cell 5:667-677; 
35 Brusslan and Tobin (1995) Plant Mol Biol 27:809-813), that of the cabl40 gene 
of Arabidopsis, the use of the promoter (as a 1.3 kb fragment) and just 14 bp of 
transcribed region fused to a completely unrelated gene was sufficient to result in 
cosuppression of the endogenous cab 140 gene as well as the introduced chimeric 
gene. This result is unusual and apparantly quite unpredictable, as numerous other 
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promoter-leader (the 5' untranslated leader being defined as the region between the 
start of transcription and the translation initiation codon) units have been used to 
drive chimeric genes successfully. Flavell speculates that some or many genes 
(including members of multigene families such as those encoding seed proteins) 
5 may have evolved so as to avoid the mechanisms of cosuppression, while others 
have not, providing a potential further level of regulation as genomes evolve. 
Thus, the instant observation that the promoter and leader of the conglycinin gene 
can be used to suppress expression of endogenous conglycins while the other 
portion of the transgene (beyond the initiation codon) can be used to suppress a 
10 completely unrelated gene is unique. 

EXAMPLES 

The present invention is further defined by the following examples. It will 
be understood that the examples are given for illustration only and the present 
invention is not limited to uses described in the examples. The present invention 

15 can be used to generate transgenic soybean plants with altered levels of various 

seed storage proteins. From the above discussion and the following examples, one 
skilled in the art can ascertain, and without departing from the spirit and scope 
thereof, can make various changes and modifications of the invention to adapt it to 
various usages and conditions. AH such modifications are intended to fall within 

20 the scope of the intended claims. 

Detailed procedures for DNA manipulation, such as use of restriction 
endonuclease enzymes, other modifying enzymes, agarose gel electrophoresis, 
nucleic acid hybridization, and transformation of E. coli with plasmid DNA are 
described in Sambrook et aL (1989) Molecular Cloning, A Laboratory manual, 

25 2nd ed, Cold Spring Harbor Laboratory Press (hereinafter "Maniatis"). All 

restriction enzymes and other modifying enzymes were obtained from Gibco BRL 
(Gaithersburg, MD). 

EXAMPLE 1 

To determine whether the expression of (J-conglycinin in developing 
30 soybean cotyledons could be the target of cosuppression, truncated cDNA 

fragments of the a and a' subunits of (3-conglycinin were prepared using a reverse 
transcriptase polymerase chain reaction kit (Geneamp™ RNA PCR Kit; Perkin 
Elmer Cetus). The upper primer, ConS, is homologous to nucleotides 5-19 of the 
a and a' subunit cDNA sequences obtained from the EMBL/GenBank/DDBJ 
35 databases. To aid cloning, additional nucleotides were added to the 5* end to code 
for an Nco I restriction site. The lower primer, Con 1 ,4a, is complementary to 
nucleotides 1370-1354 of SEQ ID NO:l and 1472-1456 of SEQ ID NO:2, 
representing the sequences of the a and a 1 cDNAs, respectively. To aid in 
cloning, additional nucleotides were added to the 5' end to introduce a Kpn I 
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The nucleotide sequences of PCR primers ConS and ConL4a are 



ConS 5 1 -CGT ACCATGGTG AGAGCGCGGTTCC-3 ' (SEQ ID NO:4) 
5 Nco I 

Con 1.4a 5 ' - C GGTACC G A ATTGAAGTGTGGT AG-3 ' (SEQ ID NO:5) 
Kpn I 

RNA isolated from developing soybean seeds was reverse-transcribed 

10 using either the kit-supplied random hexamers, or Conl. 4a, following the 

manufacturer's protocol. The resulting cDNA fragments were amplified in the 
PCR (Polymerase Chain Reaction) reaction using a mixture of ConS and Conl .4a. 
Reactant concentrations were as described in the manufacturer's protocols. The 
following program was used: a) one cycle of 2 minutes at 95°C; b) 35 cycles of: 

15 1 .5 minutes at 50°C (annealing), 5 minutes at 70°C (extension), 1 .5 minutes at 
95°C (denaturation); and c) one cycle of 2 minutes at 50°C followed by 
10 minutes at 68°C. Fifteen microliters of each of the PCR reaction mixes was 
analyzed by agarose gel electrophoresis. Reactions resulted in PCR products of 
the expected sizes: 1.47 kb for a' and 1 .37 kb for a. The truncated cDNA 

20 fragments from the remainder of the reaction mixes were purified using the 
Wizard™ PCR Preps DNA Purification System kit (Promega). 

The purified reaction mix containing the a and a' fragments, which 
because of the primers used, included Nco I restriction sites at the 5' ends and 
Kpn I restriction sites at the 3' ends, were digested with Kpn I and Nco I 

25 restriction enzymes. The a cDNA fragment was recovered following gel 
electrophoresis, designated as fragment F8, and directional ly cloned (sense 
orientation) into pCWI09 (Fig. 1) and pML70 (Fig. 2) using the Nco I to Kpn I 
sites present in both plasmids. F8 was confirmed as a by PCR using a nested set 
of primers (Con,09 and Con.8) internal to ConS and Conl .4a, and distinguished 

30 from a' by digestion of pCW109/F8 plasmid with Hind III, Nco I, Kpn I, and Pst I 
(a does not contain a Pst I site whereas a' does). 

Con,09 5 r -TCGTCCATGGAGCGCGGTTCCCATTAC-3' (SEQ IDNO:6) 

35 Con.8 5-TCTCGGTCGTCGTTGTT-3 ' (SEQ ID NO:7) 

The transcriptional unit KTi promoter/truncated a/KTi 3 f end was released 
from plasmid pML70/F8 by restriction digest with BamHl, gel isolated, and 
labeled asFIL Fl 1 was then cloned into pKS18HH (Fig. 3) at the BamH I site. 
40 pKS18HH is a plasmid construction containing the following genetic elements: 
(i) T7 promoter/Hygromycin B Phosphotransferase (HPT)/T7 Terminator 

15 
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Sequence; (ii) 35S promoter from cauliflower mosaic virus (CaMV)/Hygromycin 
B Phosphotransferase (HPT)/Nopatine Synthase (NOS) 3' from Agrobacterium 
tumefaciens T-DNA; and (iii) pSP72 piasmid vector (Promega) with beta- 
lactamase coding region removed. One skilled in the art of molecular biology can 
5 ligate the above three components into a single piasmid vector using well known 
protocols (Maniatis). 

The Hygromycin B Phosphotransferase (HPT) gene was isolated by PCR 
amplification from E. coli strain W677 containing a Klebsiella-derived piasmid 
pJR225 (Gritz L., and Davies J. (1 983) Gene 25: 1 79- 1 88). pKS 1 8HH contains 

10 the CaMV 35S/HPT/NOS cassette for constitutive expression of the HPT enzyme 
in plants, such as soybean. The pKS 1 8HH piasmid also contains the T7 
promoter/HPT/T7 terminator cassette for expression of the HPT enzyme in certain 
strains of E. coli, such as NovaBlue™ (DE3) (Novagen) that are lysogenic for 
lambda DE3 (which carries the T7 RNA Polymerase gene under lacUVS control). 

1 5 pKS 1 8HH also contains three unique restriction endonuclease sites suitable for 
cloning of genes into this vector. Thus, the pKSlSHH piasmid vector allows the 
use of Hygromycin B for selection in both K coli and plants. Confirmation of 
insertion and orientation of the Fl I fragment was accomplished by digestion with 
Hindlll. A clone with the Fl 1 fragment in clockwise orientation was selected and 

20 labeled pJol (Fig. 4). 

Transformation of Somatic Embryo Cultures 
The following stock solutions and media were used for transformation and 
propogation of soybean somatic embryos; 



Stock Solutions 
MS Sulfate lOOx stock 
MgS0 4 7H 2 0 
MnS0 4 H 2 0 
ZnS0 4 7H 2 0 
CuS0 4 5H 2 0 

MS Halides lOOx stock 
CaCl 2 2H 2 0 
KI 

CoCl 2 6H 2 0 

KH 2 P0 4 

H3BO3 



Na 2 Mo0 4 



2H 2 0 



Media 

(g/U SB55 fper Literi 

37.0 1 0 mL of each MS stock 

1 .69 1 mL of B5 Vitamin stock 

0.86 0.8 g NH 4 N0 3 

0,0025 3.033 g KNO3 

1 mL2,4-D(10mg/mL 
stock) 

44.0 0.667 g asparagine 

0.083 pH 5.7 

0.00125 

17.0 SB 103 fper Litert 

0.62 1 pk, Murashige & Skoog salt 

mixture (Gibco BRL) 

0.025 60 g maltose 
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Na 2 EDTA 
FeS0 4 



7H^O 



2.784 



3.724 



2 g gelrite 
pH5.7 

(For SB 103 plus charcoal, 
add 5 g charcoal) 



B5 Vitamin stock 
myoinositol 



SB 148 (per Liter) 



nicotinic acid 
pyridoxine HC1 
thiamine 



100.0 



i.0 



1 pk. Murashige & Skoog 
salt mixture (Gibco BRL) 

60 g maltose 



1.0 



1 mL B5 vitamin stock 



10.0 



7 g agarose 
P H 5.7 



10 



15 



20 



Soybean embryonic suspension cultures were maintained in 35 mL liquid 
media (SB55) on a rotary shaker (150 rpm) at 28°C with a mix of fluorescent and 
incandescent lights providing a 16/8 h day/night schedule. Cultures were 
subcultured every 2 to 3 weeks by inoculating approximately 35 mg of tissue into 
35 mL of liquid media* 

Soybean embryonic suspension cultures were transformed with pJol by 
the method of particle gun bombardment (see Klein et al. (1987) Nature 527:70). 
A DuPont Biolistic™ PDSIOOO/He instrument was used for these transformations. 

Five yiL of pJol plasmid DNA (1 \xg/ixL\ 50 ^L CaCl 2 (2.5 M)> and 20 nL 
spermidine (0.1 M) were added to 50 ^L of a 60 mg/mL 1 mm gold particle 
suspension. The particle preparation was agitated for 3 minutes, spun in a 
microfuge for 10 seconds and the supernatant removed. The DNA-coated 
particles were then washed once with 400 fiL 70% ethanol and resuspended in 
40 ixh of anhydrous ethanoL The DN A/particle suspension was sonicated three 
times for 1 second each. Five jxL of the DNA-coated gold particles were then 
loaded on each macro carrier disk. 

Approximately 300 to 400 mg of two week old suspension culture was 
placed in an empty 60 mm x 15 mm petri dish and the residual liquid removed 
from the tissue by pipette. The tissue was placed about 3.5 inches away from the 
retaining screen and bombarded twice. Membrane rupture pressure was set at 
1 000 psi and the chamber was evacuated to -28 inches of Hg. Two plates were 
bombarded per construct per experiment Following bombardment, the tissue was 
divided in half and placed back into liquid media and cultured as described above. 

Fifteen days after bombardment, the liquid media was exchanged with 
fresh SB55 containing 50 mg/mL hygromycin. The selective media was refreshed 
weekly. Six weeks after bombardment, green, transformed tissue was isolated and 
inoculated into flasks to generate new transformed embryonic suspension cultures. 
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Transformed embryonic clusters were removed from liquid culture media 
and placed on a solid agar media, SB103, plus 0.5% charcoal to begin maturation. 
After I week, embryos were transferred to SB 103 media minus charcoal. After 
3 weeks on SB 103 media, maturing embryos were separated and placed onto 
5 SB 1 48 media. Conditions during embryo maturation were 26° C, with a mix of 
fluorescnt and incandescent lights providing a 16/8 h day/night schedule. After 
6 weeks on SB 148 media, embryos were analyzed for the expression of the 
p-conglycinin subunit proteins. Each embryonic cluster gave rise to 5 to 
20 somatic embryos. 

10 Analysis of Transformed Somatic Embryos 

Initial experiments were performed to determine when the a, a' and p 
subunits of p-conglycinin could be visualized during somatic embryo maturation 
by SDS-PAGE gel electrophoresis. Cotyledons of non-transformed embryos 
(generated as above, except they did not undergo bombardment) were dissected 

15 from embryos at 6, 8, 10, and 12 weeks after initiating maturation and kept frozen 
at -80°C until analyzed. Cotyledonary tissue was weighed, 10 fj.L/mg tissue of 
extraction buffer was added, and the tissue ground in a Pellet Pestle Disposable 
Mixer (Kimble/Kontes), Extraction buffer consisted of 50 mM Tris-HCl (pH 7.5), 
10 mM p-mercaptoethanol (BME), and 0.1% SDS. The samples were then 

20 microfuged at 12,000 rpm for 10 minutes and supernatant remove to a new 
microfuge tube by pipette. Extracts were kept frozen at -20° C until used. 

For SDS-PAGE analysis, 8 of (2x) loading buffer was added to 8 piL of 
sample extract. The (2x) loading buffer consisted of 100 mM Tris-HCl (pH 7.5), 
4% SDS, 0.2% bromophenol blue, 1 5% glycerol, and 200 mM PME. The mixture 

25 was heated at 95°C for 4 minutes. Sample mixes were then microfuged 
(12,000 rpm for 20 seconds) and loaded onto a 10% precast Ready Gel™ 
(Bio-Rad) that was assembled into a mini-Protein II Electrophoresis Cell 
(Bio-Rad). Bio-Rad Tris/Glycine/SDS Buffer was used as the running buffer and 
voltage was a constant 125V. In addition to sample extracts, each gel contained 

30 one lane with a molecular weight standard (Bio-rad SDS-PAGE standard, low 
range) and one lane with total soybean seed protein extracted from commercial 
defatted soy flour. Upon completion, the gels were stained with Coomassie 
Brilliant Blue and destained (Maniatis) in order to visualize proteins. Gels were 
photographed, placed in a sealed bag with water, and stored in the refrigerator. 

35 Results indicated that the a* a f and p subunits of p-conglycinin were detectable in 
the cotyledons of somatic embryos between 8 and 10 weeks after the start of 
maturation. 

Analysis of transformed embryos was carried out at 1 0 weeks after the 
start of maturation using the methods described above. Two embryos per clone 
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were analyzed initially. Additional embryos were analyzed if suppression of the 
p-conglycinin subunits was observed in the two embryos. Table 1 presents the 
results of this analysis, wherein the presence or absence of each p-conglycinin 
subunit is indicated by a (+) or (-), respectively. 

5 

TABLE 1 



Clone 


Embryo 


cc 


a' 


B 


Jol-1 


1 


- 


- 






z 










3 


+ 








4 






+ 




5 




+ 




Jo 1-2 


1 


+ 


+ 


+ 




2 


+ 


+ 


+ 


Jo 1-3 


1 


+ 




+ 




2 




+ 


+ 


Jo 1-4 


1 










2 










3 










4 




4- 






5 


- 


- 




Jol-5b 


1 


+ 


+ 


+ 




2 


+ 


+ 


+ 


Jol-5c 


1 


- 


- 






2 


- 


- 


+ 


Jol-5d 


1 


+ 








2 


+ 






Jol-6a 


1 


- 


- 






2 


- 


- 






3 










4 










5 


+ 


+ 


+ 


Jol-6b 


1 




+ 


+ 




2 


+ 


+ 




Jol-6c 


1 




+ 


+ 


Jol-6d 


1 


+ 


+ 


+ 




2 




+ 


+ 


Jol-6d 


1 


+ 


+ 






2 


+ 


+ 




Jol-6e 


1 


+ 


+ 






2 


+ 


+ 




Jol-7a 


1 






+ 




2 


+ 


+ 




Jol-7b 


1 








Jol-8a 


1 


+ 






Jol-8b 


1 




+ 


+ 
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Jo! -9a 

Jo I -9b 

Jot -9c 
Jol-10 

Seven transgenic clones gave rise to embryos in which the expression a 
and a' was suppressed. In addition, one clone (Jo 1-4) gave rise to embryos in 
which all three P-conglycinin subunits were suppressed- This result is surprising 
5 as the truncated ce transgene sequence overlaps with only a 0.75 kb portion of the 
total 1 32 kb p subunit cDNA. Overall, there is only 52% similarity between the 
truncated a transgene and the p subunit cDNA. With the knowledge at hand, the 
truncated a transgene would not be considered to possess sufficient similarity of 
stucture to "cosuppress" the p subunit of the p-conglycinin gene. 

10 An example of an SDS-PAGE analysis is shown in Figure 5. Lanes 1-3 

are extracts of three cotyledons dissected from embryos generated from clone 
Jol-L Lanes 4 and 5 are protein molecular weight standards and soy protein 
standard derived from seed, respectively. Lanes 6-8 are extracts of cotyledons 
dissected from embryos generated from clone Jo 1-4. The protein pattern in lane 2 

15 is an example of embryos in which both a and <x r are co-suppressed. The protein 
patterns in lanes 6 and 8 are examples of embryos where all the subunits 
comprising p-conglycinin are suppressed. 

EXAMPLE 2 

To determine if expression of p-conglycinin could be suppressed in 
20 developing cotyledons by cosuppression using the p-conglycinin promoter region, 
a piasmid* designated pBS43, containing a Glycine max microsomal delta- 12 
desaturase cDNA (GmFad 2-1) sequence (Heppard et ah, (1996) Plant Physiol. 
110:31 1-319; GenBank Acc. No. L43920) under control of the soybean 
p-conglycinin promoter (Beachy et al., (1985) EMBOJ. 4:3047-3053), was 
25 constructed. The construction of this vector was facilitated by the use of the 
following plasmids: pMH40, pCST2 and pBS13. The piasmid constructions 
detailed below are described in part in United States Patent Application No. 
USSN 08/262,401 and World Patent Publication No. W094/1 1516, both of which 
are incorporated herein by reference. 
30 The pMH40 vector was derived from piasmid pGEM9z, a commercially 

available cloning vector (Promega Biotech) by the insertion a 1.4 kb 35S promoter 
region from CaMV (Odell et al. (1985) Nature J03:810-812; Harpster et al. (1988) 
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Mol Gen. GeneL 212: 182-190) coupled to the p-glucuronidase gene from £ colL 
This was a 1 .85 kb fragment encoding the enzyme p-glucuronidase (Jefferson et 
at. (1986) PNAS USA £3:8447-845 1 ) and a 0.3 kb DNA fragment containing the 
transcription terminator from the nopaiine synthase gene of the Ti-plasmid of 
5 Agrobacterium tumefaciens (Fraley et ah (1 983) PNAS USA 50:4803-4807). 

The vector pCST2 was derived from vectors pML 1 8 and pCWI 09A. The 
plasmid pCW109A contains the soybean p-conglycinin promoter sequence and 
the phaseolin 3 r untranslated region and is a modified version of vector pCW109 
which was derived from the commercially available plasmid pUC18 

1 0 (Gibco-BRL). The vector pC Wl 09 was made by inserting into the Hind III site of 
the cloning vector pUCl 8 a 555 bp 5' non-coding region (containing the promoter 
region) of the p-conglycinin gene followed by the multiple cloning sequence 
containing the restriction endonuclease sites for Nco I, Sma I, Kpn I and Xba I, 
then 1 1 74 bp of the common bean phaseolin 3' untranslated region into the 

1 5 Hind III site. The p-conglycinin promoter region used is an allele of the published 
P-conglycinin gene (Doyle et al, 5 (1986) J. Biol Chem. 267:9228-9238) due to 
differences at 27 nucleotide positions. Further sequence description of this gene 
may be found World Patent Publication W09 1/13993. 

To facilitate use in antisense constructions, the Nco I site and potential 

20 translation start site in the plasmid pCW109 was destroyed by digestion with 

Nco I, mung bean exonuc lease digestion and religation of the blunt site to give the 
modified plasmid pCW109A. 

The vector pML18 consists of the non-tissue specific and constitutive 
cauliflower mosaic virus (35S) promoter (Odell et al., (1985) Nature J/5:810-812; 

25 Hull et aL, (1987) Virology 56:482-493), driving expression of the neomycin 

phosphotransferase gene (Beck et ai. (1982) Gene 7P:327-336) followed by the 3* 
end of the nopaiine synthase gene including nucleotides 848 to 1550 (Depicker et 
ah (1982) J, AppL Genet 7:561-574). This transcriptional unit was inserted into 
the commercial cloning vector pGEM9z (Gibco-BRL) and is flanked at the 5' end 

30 of the 35S promoter by the restriction sites Sal I, Xba I, Bam HI and Sma I, in that 
order. An additional Sal I site is present at the 3 1 end of the NOS 3 1 sequence and 
the Xba I, Bam HI and Sal I sites are unique. The plasmid pML18 was digested 
with Xba I, the singled stranded ends were filled-in using the Klenow fragment of 
DNA polymerase I, and the product was ligated in order to remove the Xba I site. 

3 5 The resulting plasmid was designated pBS 1 6. 

The plasmid pCWI09A was digested with Hind III and the resulting 
1.84 kb fragment, which contained the p-conglycinin/antisense delta- 12 
desaturase cDNA/phaseolin 3' untranslated region, was gel isolated. This L84 kb 
fragment was ligated into the Hind III site of pBS16. A plasmid containing the 
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insert in the desired orientation yielded a 3,53 kb and 4.41 kb fragment when 
digested with Kpn I and this plasmid was designated pCST2, 

The vector pBSl3 was used as the source of the GmFad2-l cDNA, which 
encodes the soybean microsomal dettaI2-desaturase and possesses the sequence as 
5 disclosed in GenBank Acc, No. L43920. The vector pBS 1 3 was derived from the 
vector pML70 (Fig. 1), which contains the KTi3 promoter and the KTi3 3* 
untranslated region and was derived from the commercially available vector 
pTZiSR (Pharmacia) via the intermediate piasmids pML51> pML55, pML64 and 
pML65. A 2.4 kb Bst BI/Eco RI fragment of the complete soybean KTi3 gene 

10 (Jofuku and Goldberg (1989) Plant Cell 1: 1079-1093), which contains al! 
2039 nucleotides of the 5' untranslated region and 390 bases of the coding 
sequence of the KTi3 gene ending at the Eco RI site corresponding to bases 755 to 
761 of the sequence described in Jofuku (supra), was ligated into the Acc I/Eco R] 
sites of pTZl 8R to create the plasmid pML5 1 . To destroy an Nco I site in the 

1 5 middle of the 5' untranslated region of the KTi3 insert, plasmid pML5 1 was cut 
with Nco I, the singled stranded ends were filled-tn using the IClenow fragment of 
DNA polymerase I, and the product was religated resulting in the plasmid pML55. 
The plasmid pML55 was partially digested with Xmn I/Eco RI to release a 
0,42 kb fragment, corresponding to bases 732 to 755 of the above cited sequence, 

20 which was discarded. A synthetic Xmn I/Eco RI linker containing an Nco I site, 
was constructed by making a dimer of complementary synthetic oligonucleotides 
consisting of the coding sequence for an Xmn I site (S'-TCTTCC-S') and an Nco I 
site (S'-CCATGGG^') followed directly by part of an Eco RI site (5'-GAAGG-3 f ). 
The Xmn I and Nco I/Eco RI sites were linked by a short intervening sequence 

25 (5^ATAGCCCCCCAA-3'). This synthetic linker was ligated into the 

Xmn I/Eco RI sites of the 4.94 kb fragment to create the plasmid pML64, The 3' 
untranslated region of the KTi3 gene was amplified from the sequence described 
in Jofuku (supra) by standard PCR protocols (Perkin Elmer Cetus, GeneAmp 
PGR kit) using the primers ML51 and ML52. Primer MLS 1 contained the 

30 20 nucleotides corresponding to bases 1072 to 1091 of the above cited sequence 
with the addition of nucleotides corresponding to Eco RV (S-'GATATCXV), Nco I 
(5^CCATGG-3'), Xba I (5'-TCTAGA-3 , ) > Sma I (5'-CCCGGG-3') and Kpn I 
(5'-GGTACC-3 f ) sites at the 5* end of the primer. Primer ML52 contained to the 
exact compliment of the nucleotides corresponding to bases 1242 to 1259 of the 

35 above cited sequence with the addition of nucleotides corresponding to Sma I 
(5'-CCCGGG-3 r ), Eco RI (5'-GAATTC-3 r ), Bam HI (5'-GGATCC-3) and Sal I 
(5'-GTCGAC-3') sites at the 5 r end of the primer. The PCR-amplifted 3' end of 
the KTi3 gene was ligated into the Nco I/Eco RI sites of pML64 to create the 
plasmid pML65. A synthetic multiple cloning site linker was constructed by 
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making a dimer of complementary synthetic oligonucleotides consisting of the 
coding sequence for Pst I (5'-CTGCA-3'), Sal I (5'-GTCGAC-3% Bam HI 
(5'-GGATCC-3') and Pst I (5^CTGCA^3 r ) sites. The linker was Iigated into the 
Pst I site (directly 5' to the KTi3 promoter region) of pML65 to create the plasmid 
5 pML70. 

The 1 .46 kb Sma I/Kpn I fragment from soybean delta-12 desaturase 
cDNA, GmFad2-l (GenBank Acc. No. L43920), was Iigated into the 
corresponding sites in pML70 resulting in the plasmid pBSIO. The desaturase 
cDNA fragment was in the reverse (antisense) orientation with respect to the KTi3 

10 promoter in pBSlO. The plasmid pBSIO was digested with Bam HI and a 3.47 kb 
fragment, representing the KTi3 promoter/antisense desaturase cDNA/KTi3 3' end 
transcriptional unit was isolated by agarose gel electrophoresis. The vector 
pML18 consists of the non-tissue specific and constitutive cauliflower mosaic 
virus (35S) promoter (Odell et al., (1985) Nature 31 J:810-812; Hull et al., (1987) 

15 Virology 55:482-493), driving expression of the neomycin phosphotransferase 

gene (Beck et al. (1982) Gene J 9:321 -336) followed by the 3' end of the nopaline 
synthase gene including nucleotides 848 to 1550 (Depicker et al. (1982) J. AppL 
Genet. 7:561-574). This transcriptional unit was inserted into the commercial 
cloning vector pGEM9z (Gibco-BRJL) and is flanked at the 5' end of the 35S 

20 promoter by the restriction sites Sal I, Xba I, Bam HI and Sma I in that order. An 
additional Sal I site is present at the 3* end of the NOS 3 r sequence and the Xba I, 
Bam HI and Sal I sites are unique. The 3.47 kb transcriptional unit released from 
pBSIO was Iigated into the Bam HI site of the vector pML18 . When the resulting 
plasmids were digested with Sma I and Kpn I, plasmids containing inserts in the 

25 desired orientation yielded 3 fragments of 5.74, 2.69 and 1 .46 kb. A plasmid with 
the transcriptional unit in the correct orientation was selected and was designated 
pBS13. 

The 1 .46 kb Xbal/EcoRV fragment from pBS 1 3 (described above) was 
directionally cloned into the Srnal/Xbal site of vector pCST2 (described above) to 
30 yield a plasmid designated pBS39. The 33 kb Hindlll fragment of plasmid 

pBS39 was cloned into the Hindlll site of plasmid pMH40 (described above) to 
give the plant expression vector pBS43 (Fig. 6). 

Transformation of soybeans with vector pBS43 and 
identification of a transgenic "TranSwitch" line 
35 The vector pBS43 was transformed into soybean meristems using the 

method of particle bombardment of soybean meristems (Christou et al (1990) 
Trends BiotechnoL 5:145-151). Seeds of transformed plants (i.e., from plants 
which had been identified as positive for GUS activity) were screened for fatty 
acid composition. Fatty acid methyl esters were prepared from hexane extracts of 

23 



BNSDOC1D; <WO 9747731A2_L=* 



WO 97/47731 PCTYUS97/09743 
small (approx. !0 mg) seed chips (Browse et ai (1986) Anal. Biochem. 
152: 1 4 1-145). Seed chips from ten different transgenic lines were analysed and 
some of the Rl seeds from one of these lines, designated 260-05, had a total oleic 
acid content of 80-85% compared with about 20% in control seeds. This 
5 phenotype is caused by the cosuppression of the endogenous Fad 2-1 gene and is 
the result of the insertion of two copies of pBS43 into a locus of the soybean 
genome designated the "Transwitch locus" (Kinney, A. J; (1995) in "Induced 
Mutations and Molecular Techniques for Crop Improvement", International 
Atomic Energy Agency, Vienna). High oleic acid Rl seeds from line 260-05, 

10 which contained the Transwitch locus, were selfed and R2 seeds which were 
homozygous for the Transwitch locus were selected. Two of these R2 
homozygous seeds (G94-1, G94-19) and seeds derived from further generations of 
G94-1 and G94-19 (R3, R4, R5), were selected for further analysis. 

R5 seeds of G94-1 and G94-19 plants grown in both Iowa and Puerto Rico 

15 were ground into a powder and approximately 1 g extracted with 5 mL of hexane. 
After centifugation, the hexane was poured off and the flakes allowed to air dry. 
Approximately 10 mg of defatted powder was extracted as described above and 
analyzed by SDS-PAGE. In both transgenic lines derived from both locations,, the 
expression of the a 1 and a subunits of P-conglycinin were suppressed relative to 

20 control soybean lines and a standard soy flour (Fig, 7), 

EXAMPLE 3 

To test if p-conglycinin expression could be suppressed using antisense 
technology, full length cDNAs of a and a 1 were made using reverse transcriptase 
polymerase chain reaction as described above. The upper primer, ConSa, is 

25 homologous to region 4-19 of both a and a' cDNA sequences with additional 
nucleotides added to the 5 1 end to code for a Kpn I restriction site. The lower 
primer used, Con 1.9a, is homologous to regions 1818-1801 of SEQ ID NO: 1, 
representing the a isoform, and 1920-1903 of SEQ ID NO:2, representing the a* 
isoform, respectively. To aid in subsequent cloning steps, additional nucleotides 

30 were added to the 5* end to code for an Nco I restriction site. 

ConSa S'-ACGGTACCGATGAGAGCGCGGTTCC-S' (SEQ ID NO:8) 
Kpn I 

ConL9a 5 1 -AACCCATGGTCAGTAAAAAGCCCTCAA-3 f (SEQ ID NO:9) 
35 Nco I 

Reverse transcription and subsequent PCR reaction were carried out as 
described above. RN A isolated from developing soybean seeds was reverse- 
transcribed using either random hexamers or ConL9a ( method as detailed above). 
40 The cDNA was amplified in a PCR reaction using ConSa and ConL9a using the 
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protocol detailed above. Fifteen microliters of the PCR reaction mixes were 
analyzed by agarose gel electrophoresis. A 1.8 kb band, the expected molecular 
weight for a, was observed. The remaining reaction mixes were purified using 
Wizard™ PCR Preps DNA Purification System kit (Promega). The a cDNA, 
which because of the primers used included a Kpn I site on the 5' end and an Nco I 
site on the 3* end, was digested with Nco I and Kpn I restriction enzymes. The 
resulting a cDNA was gel isolated, labeled as F10, and directionally cloned 
(antisense orientation) into pCW109 using the Nco I and Kpn I sites present in the 
plasmid. F10 was confirmed as a by PCR using nested primers (upper: Con.09 
(SEQ ID NO:6); lower: Conl.4a (SEQ ID NO:5) and Conl.O (SEQ ID NO: 10)). 

Conl .0 5 f -CGGGTATGGCGAGTGTT-3' (SEQ ID NO: 1 0) 

The transcriptional unit P-conglycinin promoter/a cDNA 
antisense/phaseolin 3* end was released from pCW109/F10 by partial digest with 
Hind III. Conditions of the partial digest were such that 6 fragments were 
produced (5.1 kb, 3.8 kb, 3.6 kb, 2,6 kb, 2.4 kb, and 1 .2 kb). The 3.6 kb fragment 
containing the the transcriptional unit was gel isolated and labeled F14, F14 was 
then cloned into the Hind III site of pKS 1 8HH. After confirming insertion by 
digestion of plasmid DNA preparations made from tansformed cells with Hind III, 
the plasmid DNA from positive cultures was digested with Kpn I to ensure that 
they contained the 3.6 kb F14 fragment and not the 3.8 kb fragment from the 
partial digest of pCW109/F10 with Hind IIL F14 contains a Kpn I site, while the 
3 .8 kb fragment does not. Upon confirmation, pKS 1 8HH/F 1 4 was labeled pJo3 
(Fig. 8), Soybean embryonic suspension cultures were transformed with pJo3 as 
detailed above. Transformation resulted in 5 transformed clones; upon maturation 
each clone gave rise to 4 to 8 somatic embryos. 

Protein extracts of transformed somatic embryos were analyzed by 
SDS-PAGE as previously detailed. Results are presented in Table 2. The 
transgenic clones all gave rise to at least one somatic embryo in which the 
expression of both a and a' was suppressed. 



TABLE 2 



Clone 



Embryo 



a 



a 1 



Jo3-l 



Jo3-2 



1 

2 

1 

2 



Jo3-2b 



1 

2 
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Jo3-3 

Jo3-4 

EXAMPLE 4 

There are five non allelic genes that code for the glycinin subunits. 
Sequencing genomic clones and cDNA's have lead to a division of the subunit 
5 genes into two groups based on sequence similarity. Group I consists of Gy 1 
(SEQ ID NO: 1 1), Gy2 (SEQ ID NO: 12) and Gy3 (SEQ ID NO: 13), whereas 
group II consists of Gy4 (SEQ ID NO: 14) and Gy5 (SEQ ID NO: 15). There is 
greater than 85% similarity between genes within a group, but only 42% to 46% 
similarity between genes of different groups. To determine whether expression of 

10 glycinin can be suppressed in developing cotyledons by employing co-suppression 
technology, cDNA's of Group I and Group II were prepared using reverse 
transcriptase polymerase chain reaction as described above. 

The upper primer used for Group I reactions (Gl-I) is homologus to 
regions 1-19 for all Group I cDNA's. Two lower primers were used: Gl-1039, 

15 which is homologous with regions 1038-1022 of Gyl, 1008-992 of Gy2, and 

996-980 of Gy3; or G I -1475, which is homologus to regions 1475-1460 of Gyl, 
1445-1430 of Gy2 and 1433-1418 of Gy3. To aid in future cloning, all primers 
contained additional nucleotides that coded for a Not I restriction site at their 5' 
end. 

20 

Gl-1 5 1 - GCGGCCGCA TGGCCAAGCTAGTTTTTT-T (SEQIDNO:16) 
Not I 

Gl-1039 S'-GCGGCCGCTGGTGGCGTTTGTGA-S' (SEQIDNO:17) 
Not 1 

25 Gl-1475 S'- GCGGCCGC TCTTCTGAGACTCCTVT (SEQ ID NO:l 8) 

Not I 

RNA isolated from developing soybean seeds was reverse- transcribed 
using either random hexamers, or Gl-1475 or Gl-1039 as the lower primer in the 

30 reactions. cDNA fragments were amplified using a mixture of G 1 - 1 with either 
Gl-1039 or Gl-1475. Fifteen microliters of the PCR reaction mixes were 
analyzed by agarose gel electrophoresis. PCR reactions resulted in products of the 
expected molecular wieght, approximately 1 kb and L4-1 ,5 kb for primer sets 
Gl-l/Gl-1039 and GM/GM475, respectively. cDNA fragments from the 

35 remainder of the reaction mixes were purified using the Wizard™ PCR Preps DNA 
Purification System kit (Promega). Purified cDNA's were then digested with 
Not I and isolated by agarose gel purification. 
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The upper primer used for RT-PCR reactions of Group II (G4-7) is 
homologus to regions 7-22 for both cDNA's of Group II. Two lower primers were 
used: G4-125 1 which is homologus with regions 1251-1234 of Gy4 and 
1 153-1 135 of Gy5; or G4-1670 which is homologus to regions 1668-1653 of Gy4, 
5 There is no similar region in Gy5. To aid in future cloning all primers contained 
additional nucleotides that coded for a Not I restriction site at their 5' end. 

G4-7 5'-GCGGCCGCATGCCCTTCACTCTCT-3 f <3EQ ID NO: 1 9) 
Not I 

10 G4-1 25 1 5'- GCGGCCGCTGGG AGGGTG AGGCTGTT-3 ' (SEQ ID NO:20) 

Not I 

G4-1670 5 -GCGGCCGCTG AGCCTTGTTG AG AC-3 ' (SEQIDNO:21) 
Not! 

1 5 RNA isolated from developing soybean seeds was reverse-transcribed 

using either random hexamers, or G4-1251 or G4-1670 as the lower primer in the 
reactions. cDNA fragments were amplified using a mixture of G4-7 with either 
G4-125 1 or G4-1 670. Fifteen microliters of the PGR reaction mixes were 
analyzed by agarose gel electrophoresis. PCR reactions resulted in products of the 

20 expected molecular weight, approximately 1 .25 kb and 1 .7 kb for primer sets 
G4-7/G4-1251 and G4-7/G4- 16.70, respectively. cDNA fragments from the 
remainder of the reaction mixes were purified using the Wizard™ PCR Preps DNA 
Purification System kit (Promega). Purified cDNA's were then digested with 
Not I and isolated from gels* 

25 The isolated group I cDNAs are cloned into pRB20 (Fig. 9) at the Not I 

site (sense oritentation). After partial restriction digest with Not I and isolation of 
the single cut pRB20/group I linear fragments, group II cDNA are added to create 
final transcriptional units p-conglycinin promoter/group I cDN A (sense 
orientation)/phaseolin 3' end and (i-congiycinin promoter/group II cDNA (sense 

30 orientation)/phaseolin 3' end. The resulting plasmids are then used to transform 
somatic embryonic suspension cultures using the method detailed above. 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: E. L DU PONT DE NEMOURS AND COMPANY 

(B) STREET: 1007 MARKET STREET 

(C) CITY: WILMINGTON 

(D) STATE: DELAWARE 

(E} COUNTRY: UNITED STATES OF AMERICA 

(F) ZIP: 19898 

(G) TELEPHONE: 302-992-5481 

(H) TELEFAX: 302-773-0164 

(I) TELEX: 6717325 

<ii) TITLE OF INVENTION: SUPPRESSION OF SPECIFIC 

CLASSES OF SOYBEAN SEED 
PROTEIN GENES 

(iii) NUMBER OF SEQUENCES: 21 

(iv} COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: DISKETTE, 3.50 INCH 

(B) COMPUTER: IBM PC COMPATIBLE 

. (C) OPERATING SYSTEM: MICROSOFT WORD FOR WINDOWS 95 
(D) SOFTWARE: MICROFSOFT WORD 7.0 

(V) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/019,940 

(B) FILING DATE: JUNE 14,1996 

(vii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: LYNNE M. CHRISTENBURY 

(B) REGISTRATION NUMBER: 30,971 

(C) REFERENCE /DOCKET NUMBER: BB-1071 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1818 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO;l: 

ATGATGAGAG CACGGTTCCC ATTACTGTTG CTGGGACTTG TTTTCCTGGC TTCAGTTTCT 60 

GTCTCATTTG GCATTGCTTA CTGGGAAAAA GAGAACCCCA AACACAACAA GTGTCTCCAG 120 

AGTTGCAATA GCGAGAGAGA CTCGTACAGG AACCAAG CAT GCCACGCTCG TTGCAACCTC 180 

CTTAAGGTGG AGAAAGAAGA ATGTGAAGAA GGTGAAATTC CACGACCACG ACCACGACCA 24 0 

CAACACCCGG AGAGGGAACC TCAGCAACCC GGTGAGAAGG AGGAAGACGA AGATGAGCAA 300 

CCACGTCCAA TCCCATTCCC ACGCCCACAA CCTCGTCAAG AAGAAGAGCA CGAGCAGAGA 3 60 

GAGGAACAGG AATGGCCTCG CAAGGAGGAA AAACGCGGAG AAAAGGGAAG TGAAGAGGAA 4 20 

GATGAGGATG AGG AT G AGGA ACAAGATGAA CGTCAATTCC CATTCCCACG CCCACCTCAT 4 80 

CAGAAGGAAG AGCGAAACGA AG AG G AAG AT GAGGATGAGG AGCAGCAGCG AGAGAGCGAA 54 0 

GAAAGTGAAG ATTCTGAGTT ACGAAGACAT AAGAATAAGA ACCCTTTTCT CTTCGGCTCT 600 

AACAGGTTCG AAACTCTCTT CAAAAACCAA TATGGTCGCA TTCGCGTCCT CCAGAGGTTC 660 

AACCAACGCT CCCCACAACT TCAGAATCTC CGAGACTACC GCATTTTGGA GTTCAACTCC 720 

AAACCCAACA CCCTCCTTCT CCCCAACCAT GCTGACGCTG ATTACCTCAT CGTTATCCTT 7 80 

AACGGGACTG CCATTCTTTC CTTGGTGAAC AACGACGACA GAGACTCCTA CAGACTTCAA 840 

TCTGGTGATG CCCTGAGAGT CCCCTCAGGA ACCACAT AC T ATGTGGTCAA CCCTGACAAC 900 

AACGAAAATC TCAGAT TAAT AACACTCGCC ATACCCGTTA ACAAGCCTGG TAGATTTGAG 960 

AGTTTCTTCC TATCTAGCAC TGAAGCTCAA CAATCCTACT TGCAAGGATT CAGCAGGAAC 1020 

ATTTTAGAGG CCTCCTACGA T AC CAAATT C GAGGAGATAA ACAAGGTTCT GTTTAGTAGA 108 0 

GAG G AAGGGC AGCAGCAAGG GGAGCAGAGG CTGCAAGAGA GCGTGATTGT GGAAATCTCG 1140 

AAGGAACAGA TTCGGGCACT GAGCAAACGT GCCAAATCTA GTTCAAGGAA AACCATTTCT 1200 

TCTGAAGATA AACCTTTTAA C T T GAG AAG C CGCGACCCCA TCTACTCCAA CAAGCTTGGC 12 60 

AAGTTCTTTG AGATCACCCC AGAGAAAAAC CCCCAGCTTC GGGACTTGGA TATCTTCCTC 1320 

AGTATTGTGG ATATGAACGA GGGAGCTCTT CTTCTACCAC ACTTCAATTC AAAGGCGATA 138 0 
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GTGATACTGG TAATTAATGA AGGAGATGCA AACATTGAAC TTGTTGGCCT AAAAGAACAA 14 40 

CAACAGGAGC AGCAACAGGA AGAGCAACCT TTGGAAGTGC GGAAATATAG AGCCGAATTG 1500 

TCTGAACAAG ATATATTTGT AATCCCAGCA GGTTATCCAG TTGTGGTCAA CGCTACCTCA 1560 

AATCTGAATT TCTTTGCTAT TGGTATTAAT GCCGAGAACA ACCAGAGGAA CTTCCTCGCA 1620 

GGTTCGCAAG ACAATGTGAT AAGCCAGATA CCTAGTCAAG TGCAGGAGCT TGCATTCCCT 1680 

GGGTCTGCAC AAGCTGTTGA GAAGCTATTA AAGAACCAAA GAGAATCCTA CTTTGTGGAT 17 4 0 

GCTCAGCCTA AGAAGAAAGA GGAGGGGAAT AAGGGAAGAA AGGGTCCTTT GTCTTCAATT 18 00 

TTGAGGGCTT TTTACTGA 1818 

(2) INFORMATION FOR 5EQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 1920 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ATGATGAGAG CGCGGTTCCC ATTACTGTTG CTGGGAGTTG TTTTCCTAGC ATCAGTTTCT 60 

GTCTCATTTG GCATTGCGTA TTGGGAAAAG CAGAACCCCA GTCACAACAA GTGCCTCCGA 120 

AG TTGCAATA GCGAGAAAGA CTCCTACAGG AACCAAGCAT GCCACGCTCG TTGCAACCTC 180 

CTTAAGGTGG AG GAAGAAG A AGAATGCGAA GAAGGTCAAA TTCCACGACC AC G AC C AC AA 24 0 

CACCCGGAGA GGGAAC G TC A GCAACACGGT GAGAAGGAGG AAGACGAAGG TGAGCAGCCA 300 

CGTCCATTCC CATTCCCACG CCCACGCCAA CCTCATCAAG AGGAAGAGCA CGAGCAGAAG 3 60 

GAGGAACACG AATGGCATCG CAAGGAGGAA AAACACGGAG GAAAGGGAAG TGAAGAGGAA 4 20 

CAAGATGAAC GTGAACACCC ACGCCCACAC CAACCTCATC AAAAG G AAG A G G AAAAG C AC 4 80 

GAATGGCAAC ACAAGCAGGA AAAGCACCAA GG AAAG G AAA GTGAAGAAGA AGAAGAAGAC 54 0 

CAAGACGAGG ATGAGGAGCA AGACAAAGAG AGCCAAGAAA GTGAAGGTTC TGAGTCTCAA 600 

AGAGAACCAC GAAGACATAA GAATAAGAAC CCTTTTCACT TCAACTCTAA AAGGTTCCAA 660 

ACTCTCTTCA AAAACCAATA TGGCCACGTT CGCGTCCTCC AGAGGTTCAA CAAACGCTCC 720 

CAACAGCTTC AGAATCTCCG AGAC TACCGC ATTTTGGAGT TCAACTCCAA ACCCAACACC 780 

CTTCTTCTCC CCCACCATGC TGACGCTGAT TACCTCATCG TTATCCTTAA CGGGACTGCC 84 0 

ATTCTTACCT TGGTGAACAA CGACGACCGA GACTCTTACA ACCTTCAATC TGGCGATGCC 900 
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CTAAGAGTCC CTGCAGGAAC CACATTCTAT GTGGTTAACC CTGACAACGA CGAGAATCTC 960 

AGAA T GAT AG CAGGAACCAC ATTCTATGTG GTTAACCCTG ACAACGACGA GAATCTCAGA 1020 

ATGATAACAC TCGCCATACC CGTTAACAAA CCCGGTAGAT TTGAGAGTTT CTTCCTATCT 108 0 

AGCACTCAAG CTCAACAGTC CTACTTGCAA GGGTTCAGCA AGAATATTCT AGAGGCCTCA 114 0 

TACGACACCA AATTCGAGGA GATAAACAAG GTTCTGTTTG GTAGAGAGGA GGGGCAGCAA 12 00 

CAAGGGGAGG AGAGGCTGCA AGAGAGTGTG ATTGTGGAAA TCTCAAAGAA ACAAATTCGG 1260 

GAACTGAGCA AACATGCCAA ATCTAGTTCA AGGAAAACCA TTTCTTCTGA AGATAAACCT 132 0 

TTCAACTTGG GAAGCCGCGA CCCCATCTAT TCCAACAAGC TTGGCAAGTT GTTTGAGATT 138 0 

ACCCAGAGAA ACCCTCAGCT TCGGGACTTG GATGTCTTCC TCAGTGTTGT GGATATGAAC 14 40 

GAGGGAGCTC TTTTTCTACC ACACTTCAAT TCAAAGGCCA TAGTGGTACT AGTGATTAAT 1500 

GAAGGAGAAG CAAACATTGA ACTTGTTGGC ATTAAAGAAC AACAACAGAG GCAGCAACAG 15 60 

GAAGAGCAAC CTTTGGAAGT GCGGAAATAT AGAGCTGAAT TGTCTGAACA AGATATATTT 162 0 

GTAATCCCAG CAGGTTATCC AGTTATGGTC AACGCTACCT CAGATCTGAA TTTCTTTGCT 1680 

TTTGGTATCA ATGCCGAGAA CAACCAGAGG AACTTCCTTG CAGGTTCGAA AGACAATGTG 17 4 0 

ATAAGCCAGA TACCTAGTCA AGTGCAGGAG CTTGCGTTCC CTAGGTCTGC AAAAGATATT 1800 

GAGAACCTAA TAAAGAGCCA AAGTGAGTCC TACTTTGTGG ATGCTCAGCC TCAGCAGAAA 18 60 

GAGGAGGGGA ACAAGGGAAG AAAGGGTCCT TTGTCTTCAA TTTTGAGGGC TTTTTACTGA 1920 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1320 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii} MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATGATGAGAG TGCGGTTTCC TTTGTTGGTG TTGCTGGGAA CTGTTTTCCT GGCATCAGTT 60 

TGTGTCTCAT TAAAGGTGAG AGAGGATGAG AATAACCCTT TCTACTTTAG AAGC7CTAAC 120 

AGCTTCCAAA CTCTCTTTGA GAACCAAAAC GTTCGCATTC GTCTCCTCCA GAGATTCAAC 18 0 

AAACGCTCCC CACAACTTGA GAACCTTCGA GACTACCGGA TTGTCCAGTT TCAGTCAAAA 24 0 

CCCAACACAA TCCTTCTCCC CCACCATGCT GACGCCGATT TCCTCCTCTT TGTCCTTAGC 30 0 

GGGAGAGCCA TACTTACCTT GGTGAACAAC GACGACAGAG ACTCCTACAA CCTTCACCCT 3 60 

31 



BNSDOCID; <WO 9747731 A2_l_> 



WO 97/47731 PCT/OS97/09743 

GGCGATGCCC AGAGAATCCC AGCTGGAACC ACT TACT ATT TGGTTAACCC TCACGACCAC 4 20 

CAGAATCTCA AAATAATCAA ACTTGCCATA CCCGTCAACA AACCTGGCAG ATATGATGAT 4 80 

TTCTTCTTAT CTAGCACTCA AGCCCAACAG TCCTACTTGC AAGGCTTCAG CCATAATATT 54 0 

CTAGAGACCT CCTTCCATAG CGAATTCGAG GAG AT AAAC A GGGTTTTGTT TGGAGAGGAA 600 

GAGGAGCAGA GGCAGCAAGA GGGAGTGATC GTGGAACTCT CAAAGGAACA AATTCGGCAA 660 

CTGAGCAGAC GTGCCAAATC TAGTTCAAGG AAAACCATTT CCTCCGAAGA TGAACCATTC 720 

AACTTGAGAA GCCGCAACCC CATC T ATT CC AACAACTTTG GAAAGTTCTT TGAGATCACC 7 80 

CCTGAGAAAA ACCCACAGCT TCGGGACTTG GATATCTTCC TCAGTTCTGT GGATATCAAC 84 0 

GAAGGAGCTC TTCTTCTACC ACACTTCAAT TCAAAGGCCA TAGTGATACT AGTGATTAAT 900 

GAAGGAGATG C AAAC ATT G A ACTTGTTGGC AT T AAAGAAC AACAACAGAA GCAGAAACAG 960 

GAAGAGGAAC CTTTGGAAGT GCAAAGGTAC AGAGCTGAAT TGTCTGAAGA CGATGTATTT 1020 

GTAATTCCAG CAGCTTATCC ATTTGTCGTC AACGCTACCT CAAACCTCAA TTTCCTTGCT 1080 

TTTGGTATCA ATGCTGAGAA CAACCAGAGG AACTTCCTTG CAGGCGAGAA AGACAATGTG 114 0 

GTAAGGCAGA TAGAAAGACA AGTGCAGGAG CTTGCGTTCC CTGGGTCTGC ACAAGATGTT 1200 

GAGAGGCTAT TAAAGAAGCA GAGGGAATCC TACTTTGTTG ATGCTCAGCC TCAGCAGAAG 1260 

GAGGAGGGGA GTAAGGGAAG AAAGGGTCCT TTTCCTTCAA TCTTAGGTGC TCTCTACTGA 1320 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CGTACCATGG TGAGAGCGCG GTTCC 25 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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CGGTACCGAA TTGAAGTGTG GTAG 



24 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 27 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TCGTCCATGG AGCGCGGTTC CCATTAC 27 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TCTCGGTCGT CGTTGTT 17 
(2) INFORMATION FOR SEQ ID NO : 8 : 



(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
IC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



ACGGTACCGA TGAGAGCGCG GTTCC 



25 



(2) 



INFORMATION FOR SEQ ID NO: 9: 
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Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
AACCCATGGT CAGTAAAAAG CCCTCAA 
(2) r N FORMAT I ON FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE; nucleic acid 
fC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

{ii} MOLECULE TYPE: DNA {genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 10: 

CGGGTATGGC GAGTGTT 

(2) INFORMATION FOR SEQ ID NO: 11: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1488 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

Cxi) SEQUENCE DESCRIPTION : SEQ ID NO: 11: 
ATGGC CAAGC TAGTTTTTTC CCTTTGTTTT CTGCTTTTCA GTGGCTGCTG CTTCGCTTTC 
AGTTCCAGAG AGCAGCCTCA G C AAAACG AG TGCCAGATCC AAAAACTCAA TGCCCTCAAA 
CCGGATAACC GTATAGAGTC AGAAGGAGGG CTCATTGAGA CATGGAACCC TAACAACAAG 
CCATTCCAGT GTGCCGGTGT TGCCCTCTCT CGCTGCACCC TCAACCGCAA CGCCCTTCGT 
AGACCTTCCT ACACCAACGG TCCCCAGGAA ATCTACATCC AACAAGGTAA GGGTATTTTT 
GGCATGATAT ACCCGGGTTG TCCTAGCACA TTTGAAGAGC CTCAACAACC TCAACAAAGA 
G GAC AAAGC A GCAGACCACA AGACCGTCAC CAGAAGATCT ATAACTTCAG AGAGGGTGAT 
TTGATCGCAG TGCCTACTGG TGTTGCATGG TGGATGTACA ACAATGAAGA CACTCCTGTT 
GTTGCCGTTT CTATTATTGA CACCAACAGC TTGGAGAACC AGCTCGACCA GATGCCTAGG 
AGATTCTATC TTGCTGGGAA CCAAGAGCAA GAGTTTCTAA AATATCAGCA AGAGCAAGGA 
GGTCATCAAA GCCAGAAAGG AAAGCATCAG CAAGAAGAAG AAAACGAAGG AGGCAGCATA 
TTGAGTGGCT TCACCCTGGA ATTCTTGGAA CATGCATTCA GCGTGGACAA GCAGATAGCG 
AAAAAC CT AC AAGGAGAGAA CGAAGGGGAA GACAAGGGAG CCATTGTGAC AGT GAAAGG A 
GGTCTGAGCG TGATAAAACC ACCCACGGAC GAGCAGCAAC AAAGACCC C A GGAAGAGGAA 
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GAAGAAGAAG AGGATGAGAA GCCACAGTGC AAGGGTAAAG ACAAACACTG 
CGAGGAAGCC AAAGCAAAAG CAGAAGAAAT GGCATTGACG AGACCATATG 
CTTCGCCACA ACATTGGCCA G ACT TC ATC A CCTGACATCT ACAACCCTCA 
GTCACAACCG CCACCAGCCT TGACTTCCCA GCCCTCTCGT GGCTCAGACT 
TTTGGATCTC TCCGCAAGAA TGCAATGTTC GTGCCACACT ACAACCTGAA 
ATAATATACG CATTGAATGG ACGGGCATTG ATACAAGTGG TGAATTGCAA 
GTGTTTGATG GAGAGCTGCA AGAGGGACGG GTGCTGATCG TGCCACAAAA 
GCTGCAAGAT CACAGAGTGA CAACTTCGAG TATGTGTCAT TCAAGACCAA 
ATGATCGGCA CTCTTGCAGG GGCAAACTCA TTGTTGAACG CATTACCAGA 
CAGCACACTT TCAACCTAAA AAGCCAGCAG GCCAGGCAGA TAAAGAACAA 
AAGTTCCTGG TTCCACCTCA GGAGTCTCAG AAGAGAGCTG TGGCTTAG 
(2) iNFORt^ATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 145B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 12: 
ATGGCCAAGC TTGTTCTTTC CCTTTGTTTC CTTCTTTTCA GTGGCTGCTT 
GAGCAGGCAC AGCAAAATGA GTGCCAGATC CAAAAGCTGA ATGCCCTCAA 
CGTATAGAGT CGGAAGGTGG GTTCATTGAG ACATGGAACC CTAACAACAA 
TGTGCCGGTG TTGCCCTCTC TCGCTGCACC CTTAACCGCA ATGCCCTTCG 
TACACCAACG GTCCCCAGGA AATCTACATA CAACAAGGTA ATGGTATTTT 
TTCCCGGGTT GTCCTAGCAC TTATCAAGAG CCGCAAGAAT CTCAGCAACG 
CAGAGGCCCC AAGACCGTCA CCAAAAGGTA CATCGCTTCA G AG AGGGT G A 
GTGCCTACTG GTGTTGCATG GTGGATGTAC AACAATGAAG ACACTCCTGT 
TCTATTATTG ACACCAACAG CTTGGAGAAC CAGCTCGACC AGATGCCTAG 
CTTGCTGGGA ACCAAGAGCA AGAGTTTCTA AAATATCAGC AGCAGCAGCA 
CAAAGCCAGA AAGGAAAGCA ACAAGAAGAA GAAAACGAAG GAAGCAACAT 
TTCGCCCCTG AATTCTTGAA AGAAGCGTTC GGCGTGAACA TGCAGATAGT 
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CCAACGCCCC 900 

CACCATGAGA 960 

AGCCGGTAGC 102 0 

CAGTGCTGAG 108 0 

CGCGAACAGC 114 0 

CGGTGAGAGA 1200 

CTTTGTGGTG 12 60 

TGATACACCC 132 0 

GGAAGTGATT 1380 

CAACCCTTTC 14 40 



1488 



CGCTCTGAGA 
ACCGGATAAC 
GCCATTCCAG 
TAGACCTTCC 
T G GC AT GAT A 
AGGACGAAGC 
TTTGATCGCA 
TGTTGCCGTT 
GAGATTCTAT 
AGGAGGTTCC 
ATTGAGTGGC 
GAGAAACCTA 
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CAAGGTGAGA ACGAAGAGGA GGATAGTGGA GCCATTGTGA CAGTGAAAGG AGGTCTAAGA 780 

GTCACAGCTC CAGCCATGAG GAAGCCACAG CAAGAAGAAG ATGATGATGA TGAGGAAGAG 8 40 

CAGCCACAGT GCGTGGAGAC AG ACAAAG GT TGCCAACGCC AAAGCAAAAG GAG C AG AAAT 900 

GGCATTGATG AGACCATTTG CACAATGAGA CTTCGCCAAA ACATTGGTCA GAATTCATCA 960 

CCTGACATCT ACAACCCTCA AGCTGGTAGC ATCACAACCG CCACCAGCCT TGACTTCCCA 1020 

GCCCTCTGGC TTCTCAAACT CAGTGCCCAG TATGGATCAC TCCGCAAGAA TGCTATGTTC 108 0 

GTGCCACACT ACACCCTGAA CGCGAACAGC ATAATATACG CATTGAATGG GCGGGCATTG 114 0 

GTACAAGTGG TGAATTGCAA TGGTGAGAGA GTGTTTGATG GAGAGCTGCA AGAGGGAGGG 1200 

GTGCTGATCG TTCCACAAAA CTTTGCGGTG GCTGCAAAAT CCCAGAGCGA TAACTTTGAG 12 60 

TATGTGTCAT TCAAGACCAA TGATAGACCC TCGAT CGGAA ACCTTGCAGG GGC AAACT C A 1320 

TTGTTGAACG CATTGCCAGA GGAAGTGATT CAGCACACTT TTAACCTAAA GAGCCAGCAG 1380 

GCCAGGCAGG TGAAGAACAA CAACCCTTTC AGCTTCCTTG TTCCACCTCA GGAGTCTCAG 14 4 0 

AGGAGAGCTG TGGCTTAG 14 58 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

AT GGCTAAG C TTGTTCTTTC CCTTTGTTTT CTGCTTTTCA GTGGCTGCTG CTTCGCTTTC 6 0 

AGTTTCAGAG AGCAGCCACA G C AAAAC GAG TGCCAGATCC AACGCCTCAA TGCCCTAAAA 120 

CCGGATAACC GTATAGAGTC AGAAGGTGGC TTCATTGAGA CATGGAACCC TAACAACAAG 180 

CCATTCCAGT GTGCCGGTGT TGCCCTCTCT CGCTGCACCC TCAACCGCAA CGCCCTTCGC 24 0 

AGACCTTCCT ACACCAACGC TCCCCAGGAG ATCTACATCC AACAAGGTAG TGGTATTTTT 300 

GGCAT GAT AT TCCCGGGTTG TCCTAGCACA TTTGAAGAGC CTCAACAAAA AGGACAAAGC 360 

AGCAGGCCCC AAGACCGTCA CCAGAAGATC TATCACTTCA GAGAGGGTGA TTTGATTGCA 4 20 

GTGCCAACCG GTTTTGCATA CTGGATGTAC AACAATGAAG ACACTCCTGT TGTTGCCGTT 4 80 

TCTCTTATTG ACACCAACAG CTTCCAGAAC CAGCTCGACC AGATGCCTAG GAGATTCTAT 54 0 

CTTGCTGGGA ACCAAGAGCA AGAGTTTCTA CAGTATCAGC CACAGAAGCA GCAAGGAGGT 600 
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ACTCAAAGCC AGAAAGGAAA GCGTCAGCAA GAAGAAGAAA ACGAAGGAGG CAGCATATTG 660 

AGTGGCTTCG CCCCGGAATT CTTGGAACAT GCGTTCGTCG TGGACAGGCA GATAGTGAGA 7 20 

AAGCTACAAG GTGAGAACGA AGAGGAAGAG AAGGGTGCCA TTGTGACAGT GAAAGGAGGT 7 80 

CTCAGCGTGA TAAGCCCACC CACGGAAGAG CAGCAACAAA GACCCGAGGA AGAGGAGAAG 8 40 

CCAGATTGTG ACGAGAAAGA CAAACATTGC CAAAGCCAAA GCAGAAATGG CAT TGACGAG 900 

ACCATTTGCA CAATGAGACT TCGCCACAAC ATTGGCCAGA CTTCATCACC TGACATCTTC 960 

AACCCTCAAG CTGGTAGCAT CACAACCGCT ACCAGCCTCG ACTTCCCAGC CCTCTCGTGG 102 0 

CTCAAACTCA GTGCCCAGTT TGGATCACTC CGCAAGAATG CTATGTTCGT GCCACACTAC 1080 

AACCTGAACG CAAACAGCAT AATATACGCA TTGAATGGAC GGGCATTGGT ACAAGTGGTG 114 0 

AATTGCAATG GTGAGAGAGT GTTTGATGGA GAGCTGCAAG AGGGACAGGT GTTAATTGTG 1200 

CCACAAAACT TTGCGGTGGC TGCAAGATCA C AG AG CG ACA ACTTCGAGTA TGTTTCATTC 1260 

AAGACCAATG A TAG AC CCTC GATCGGCAAC CTTGCAGGTG CAAACTCATT GTTGAACGCA 1320 

TTGCCGGAGG AAGTGATTCA GCAAACTTTT AACCTAAGGA GGCAGCAGGC CAGGCAGGTC 1380 

AAGAACAACA ACCCTTTCAG CTTCCTGGTT CCACCTAAGG AGTCTCAGAG GAGAGTTGTG 14 4 0 

GCTTAG 14 4 6 
(2) INFORMATION FOR SEQ ID NO: 14: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 168 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

{xi> SEQUENCE DESCRIPTION: SEQ ID NO: 14 : 

ATGGGGAAGC CCTTCACTCT CTCTCTTTCT TCCCTTTGCT TGCTACTCTT GTCGAGTGCA 60 

TGCTTTGCTA TTAGCTCCAG CAAGCTCAAC GAGTGCCAAC TCAACAACCT CAACGCGTTG 120 

GAACCCGACC ACCGCGTTGA GTTCGAAGGT GGTTTGATTC AAACATGGAA CTCTCAACAC 180 

CCTGAGCTGA AATGCGCCGG TGTCACTGTT TCCAAACTCA CCCTCAACCG CAATGGCCTC 240 

CACTTGCCAT CTTACTCACC TTATCCCCGG ATGATCATCA TCGCCCAAGG GAAAGGAGCA 300 

CTGCAGTGCA AGCCAGGATG TCCTGAGACG TTTGAGGAGC CACAAGAACA ATCAAACAGA 360 

AGAGGCTCAA GGTCGCAGAA GCAGCAGCTA CAGGACAGTC ACCAGAAGAT TCGTCACTTC 4 20 

AATGAAGGAG ACGTACTCGT GATTCCTCCT GGTGTTCCTT ACTGGACCTA TAACACTGGC 4 80 
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GATGAACCAG TTGTTGCCAT CAGTCTTCTT GACACCTCTA ACTTCAATAA CCAGCTTGAT 54 0 

CAAACCCCTA GGGTATTTTA CCTTGCTGGG AACCCAGATA TAGAGTACCC AGAGACCATG 600 
CAACAACAAC AACAGCAGAA AAGTCATGGT GGACGCAAGC AGGGGCAACA CCAGCAGGAG 660 
GAAGAGGAAG AAGGTGGCAG CGTGCTCAGT GGCTTCAGCA AACACTTCTT GGCACAATCC 720 
TTCAACACCA ACGAGGACAT AGCTGAGAAA CTTCAGTCTC CAGACGACGA AAGGAAGCAG 78 0 
ATCGTGACAG TGGAAGGAGG TCTCAGCGTT ATCAGCCCCA AGTGGCAAGA ACAACAAGAT 840 
GAAGATGAAG ATGAAGACGA AGATGATGAA GATGAACAAA TTCCCTCTCA CCCTCCTCGC 900 
CGACCAAGCC ATGGAAAGCG TGAACAAGAC GAGGACGAGG ACGAAGATGA AG AT AAACC T 960 

CGTCCTAGTC GACCAAGCCA AGGAAAGCGT GAACAAGACC AGGACCAGGA CGAGGACGAA 1020 

GATGAAGATG AAGATCAACC TCGCAAGAGC CGCGAATGGA GATCGAAAAA GACACAACCC 1080 

AGAAGACCTA GACAAGAAGA ACCACGTGAA AGAGGATGCG AGACAAGAAA CGGGGTTGAG 114 0 

GAAAATATCT GCACCTTGAA GCTTCACGAG AACATTGCTC GCCCTTCACG CGCTGACTTC 1200 

TACAACCCTA AAGCTGGTCG CAT TAG T ACC CTCAACAGCC TCACCCTCCC AGCCCTCCGC 1260 

CAATTCCAAC TCAGTGCCCA ATATGTTGTC CTCTACAAGA ATGGAATTTA CTCTCCACAT 1320 

TG.GAATC TGA ATGCAAACAG TGTGATCTAT GTGACTCGAG GACAAGGAAA GGTTAGAGTT 1380 

GTGAACTGCC AAGGGAATGC AGTGTTCGAC GGTGAGCTTA GGAGGGGACA ATTGCTGGTG 14 4 0 

GTACCACAGA ACTTCGTGGT GGCGGAGCAA GCCGGAGAAC AAGGATTCGA ATACATAGTA 15 00 

TTCAAGACAC ACCACAACGC AGTCACTAGC TACTTGAAGG ATGTGTTTAG GGCAATTCCC 15 60 

TCAGAGGTTC TTGCCCATTC TTACAACCTT CGACAGAGTC AAGTGTCTGA GCTTAAGTAT 162 0 

GAAGGAAATT GGGGTCCTTT GGTCAACCCT GAGTCTCAAC AAGGCTCACC CCGTGTTAAA 16 80 
GTCGCATAA 

(2) INFORMATION FOR SEQ ID NO: 15: 



1689 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1551 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
ATGGGGAAGC CCTTCTTCAC TCTCTCTCTT TCTTCCCTTT GCTTGCTACT CTTGTCGAGT 60 
GCATGCTTTG CTATTACCTC CAGCAAGTTC AACGAGTGCC AACTCAACAA CCTCAACGCG 120 
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TTGGAACCCG ACCACCGCGT TGAGTCCGAA GGTGGTCTTA TTGAAACATG GAACTCTCAA 18 0 

CACCCTGAGC TGCAATGCGC CGGTGTCACT GTTTCCAAAC GCACCCTCAA CCGCAACGGC 24 0 

TCCCACTTGC CATCTTACTT ACCTTATCCC CAAATGATCA TTGTCGTTCA AGGGAAGGGA 300 

GCAATTGGAT TTGCATTTCC GGGATGTCCC GAGACGTTTG AGAAGCCACA ACAACAATCA 360 

AGCAGAAGAG GCTCAAGGTC ACAGCAGCAA CTACAAGACA GTCACCAGAA GATTCGTCAC 42 0 

TTCAATGAAG GAGACGTACT AGTGATTCCT CTTGGTGTTC CTTACTGGAC CTATAACACT 4 80 

GGCGATGAAC CAGTTGTTGC CATCAGTCCT CTTGACACCT CCAACTTCAA CAATCAGCTT 54 0 

GATCAAAACC CCAGAGTATT TTACCTTGCT GGGAACCCAG ATATAGAGCA TCCCGAGACC 600 

ATGCAACAAC AGCAGCAGCA GAAGAGTCAT GGTGGACGCA AGCAGGGGCA ACACCGACAG 6 60 

CAGGAGGAAG AAGGTGGCAG TGTGCTCAGT GGCTTCAGCA AACATTTCTT AGCACAATCC 720 

TTCAACACCA ACGAGGACAC AGCTGAGAAA CTTCGGTCTC CAGATGACGA AAGGAAGCAG 7 80 

ATCGT G AC AG TGGAGGGAGG CCTCAGCGTT ATCAGCCCCA AGTGGCAAGA ACAAGAAGAC 84 0 

GAAGACGAAG ACGAAGACGA AGAAT AT GGA CGGACGCCCT CTTATCCTCC ACGACGACCA 900 

AGCCATGGAA AGCATGAAGA TGACGAGGAC GAGGACGAAG AAGAAGATCA ACCTCGTCCT 960 

GATCACCCTC CACAGCGACC AAGCAGGCCC GAACAACAAG AACCACGTGG AAGAGGATGT 1020 

CAGACTAGAA ATGGGGTTGA GGAAAATATT TGCACCATGA AGCTTCACGA GAACATTGCT 1080 

CGCCCTTCAC GTGCTGACTT CTACAACCCA AAAGCTGGTC GCATTAGCAC CCTCAACAGT 114 0 

CTCACCCTCC CAGCCCTCCG CCAATTCGGA CTCAGTGCCC AATATGTTGT CCTCTACAGG 12 00 

AATGGAATTT ACTCTCCAGA TTGGAACTTG AACGCGAACA GTGTGACGAT GACTCGAGGG 1260 

AAAGGAAGAG TTAGAGTGGT GAACTGCCAA GGGAATGCAG TGTTCGACGG TGAGCTAAGG 132 0 

AGGGGACAAT TGCTAGTGGT GCCGCAGAAC CCCGCGGTGG CTGAGCAAGG GGGAGAACAA 1380 

GGATTGGAAT ATGTAGTGTT CAAGACACAC CACAACGCCG TGAGCAGCTA CATTAAGGAT 144 0 

GTGTTTAGGG TAATCCCTTC GGAGGTTCTT TCCAATTCTT ACAACCTTGG CCAGAGTCAA 1500 

GTGCGTCAGC TCAAGTATCA AGGAAACTCC GGCCCTTTGG TCAACCCATA A 1551 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 
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(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GCGGCCGCAT GGCCAAGCTA GTTTTTT 27 
(2) INFORMATION FOR SEQ ID NO: 17: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GCGGCCGCTG GTGGCGTTTG TGA 23 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: IB: 
GCGGCCGCTC TTCTGAGACT CCT 23 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GCGGCCGCAT GCCCTTCACT CTCT 2 4 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GCGGCCGCTG GGAGGGTGAG GCTGTT 26 
(2) INFORMATION FOR SEQ ID NO:22; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY r linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21; 
GCGGCCGCTG AGCCTTGTTG AGAC 2 4 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulc 13 

A. The indications made below relate to the microorganism referred lo in the description 

on page 7 , line 1 

B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet 

Name of depositary institution 
AMERICAN TYPE CULTURE COLLECTION 



Address of depositary institution (including postal code and country) 
12301 Parklawn Drive 
Rockville, Maryland 20852 
US 



Dnlc of deposit 

15 June 1996 


Accession Number 

97614 


C, ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q j 



In respect of those designations in which a European patent is sought, 
a sample of the deposited microorganism will be made available until 
the publication of the mention of the grant of the European patent or 
until the date on which the application has been refused or withdrawn 
or is deemed to be withdrawn % only by the issue of such a sample to an 
expert nominated by the person requesting the sample* (Rule 28(4) EPC) 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (lea\>e blank if not applicable) 



The indications listed below will be submitted to the international Bureau later (s^yecify the general nature of the indication* e.g.. "Accession 
Number of Deposit") 



For receiving Office use only 



[~| Tins sheet was received with the international application 



Authorized officer 



For International Bureau use only 



j"] This sheet was received by the International bureau on: 



Authorized officer 



Form PCT/RO/ r I34 (July 1992) 
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A- The indications made below relate to the microorganism referred to in the description 
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Authorized officer 



For International Bureau use only 
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CLAIMS 

1 . A method for reducing the quantity of a soybean seed storage protein 
in soybean seeds comprising: 

(a) constructing a chimeric gene comprising: 

5 (i) a nucieic acid fragment encoding a promoter that is 

functional in the cells of soybean seeds; 
(ii) a nucleic acid fragment encoding all or a portion of a 
soybean seed storage protein placed in sense or 
antisense orientation relative to the promoter of (i); and 
1 0 (iii) a transcriptional termination region; 

(b) creating a transgenic soybean ceil by introducing into a soybean 
cell the chimeric gene of (a); and 

(c) growing the transgenic soybean cells of step (b) under 
conditions that result in expression of the chimeric gene of 

1 5 step (a) 

wherein the quantity of one or more members of a class of soybean seed storage 
protein subunits is reduced when compared to soybeans not containing the 
chimeric gene of step (a). 

2. The method of Claim 1 wherein the soybean seed storage protein is 
20 selected from the group consisting of glycinin and p-conglycinin. 

3. The method of Claim 1 wherein the nucleic acid fragment encoding all 
or a portion of a soybean seed storage protein is placed in sense orientation 
relative to the promoter region. 

4. The method of Claim 1 wherein the nucleic acid fragment encoding all 
25 or a portion of a soybean seed storage protein is placed in antisense orientation 

relative to the promoter region. 

5. The method of Claim 4 wherein the nucieic acid fragment encodes the 
alpha subunit of the p-conglycinin soybean seed storage protein. 

6. The method of Claim 1 wherein at least two members of a class of 

30 soybean seed storage protein subunits are reduced when compared to soybeans not 
containing the chimeric gene of step (a), 

7. A method for simultaneously reducing the expression of two soybean 
genes comprising: 

(a) constructing a chimeric gene comprising: 
35 (i) a nucleic acid fragment encoding a promoter region 

from a soybean seed storage protein gene; and 
(ii) a nucleic acid fragment encoding all or a portion of a 
soybean protein that is not the soybean seed storage 
protein of (i), said nucleic acid fragment placed in sense 
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or antisense orientation relative to the promoter of (i), 
and (iii) a transcriptional termination region; 
(b) creating a transgenic soybean seed by introducing into a soybean 
seed the chimeric gene of (a); and 
5 (c) growing the transgenic soybean seeds of step (b) under 

conditions that result in expression of the chimeric gene of 
step (a) 

wherein the quantity of one or more members of a class of soybean seed storage 

protein subunits and the quantity of the protein encoded by the nucleic acid 
1 0 fragment of (a)(ii) is reduced when compared to soybeans not containing the 

chimeric gene of step (a). 

8, The method of Claim 7 wherein the nucleic acid fragment encoding all 

or a portion of a soybean protein that is not the soybean seed storage protein of 

(a)(i) is placed in sense orientation relative to the promoter region, 
15 9. The method of Claim 7 wherein the nucleic acid fragment encoding all 

or a portion of a soybean protein that is not the soybean seed storage protein of 

(a)(i) is placed in antisense orientation relative to the promoter region. 

1 0. The method of Claim 7 wherein the promoter is derived from the gene 

encoding the alpha subunit of the p-conglycinin soybean seed storage protein. 
20 11. The method of Claim 7 wherein the nucleic acid fragment encoding all 

or a portion of a soybean protein that is not the soybean seed storage protein of 

(a)(i) encodes a gene involved in fatty acid biosynthesis. 

12. The method of Claim 7 wherein quantity of one or more members of a 
class of soybean seed storage protein subunits and the quantity of the protein 

25 encoded by the nucleic acid fragment of (a)(ii) are reduced when compared to 
soybeans not containing the chimeric gene of step (a). 

1 3 . The method of Claim 1 3 wherein at least two members of a class of 
soybean seed storage protein subunits are reduced when compared to soybean 
seeds not containing the chimeric gene of step (a), and wherein the fatty acid 

30 profile of soybean seeds containing the chimeric gene of step (a) is altered when 
compared to soybean seeds not containing the chimeric gene of step (a). 

14. A transgenic soybean plant prepared by the method of Claim 1 . 

15. A transgenic soybean plant prepared by the method of Claim 7. 

16. Transgenic seeds derived from plants of Claim 14. 
35 1 7. Transgenic seeds derived from plants of Claim 15. 

1 8. A transgenic soybean plant wherein the quantity of one or more 
members of a ctass of soybean seed storage protein subunits is reduced in the 
seeds of said plant when compared to seeds derived from a non-transgenic 
soybean plant, 
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19. Transgenic seeds derived from plants of Claim 1 8* 

20. A transgenic soybean plant wherein 

(i) the quantity of one or more members of a class of soybean seed 
storage protein subunits is reduced; and 

(ii) the oleic acid content relative to the content of other fatty acids 
is increased in the seeds of said plant when compared to seeds 
derived from a non-transgenic soybean plant, 

21 . Transgenic seeds derived from plants of Claim 20. 
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Figure 1. 
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Figure 2. 
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Figure 3. 
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Figure 4 
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Figure 5. SDS-PAGE of protein extracted from transformed 
somatic embryos Jo1 -1 and Jo1 -4 
(10% gel, coomassie stained) 
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Figure 6. 
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Figure 7. SDS-PAGE of protein extracted from transformed 
soybean seeds of G94-1 andG94-19 
{10% gel, coomassie stained) 
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Figure a. 
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Figure 9 
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